105 items found

Organisations: SoBigData Services and Products Availability: On-Line Types: Dataset

Filter Results
  • Dataset

    Multi-aspect Integrated Migration Indicators (MIMI) dataset

    The Multi-aspect Integrated Migration Indicators (MIMI) dataset is a new dataset to be exploited in migration studies as a concrete example of this new approach. It includes...
    • HTML
      The resource: 'Multi-aspect Integrated ...' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to scientific article.' is not accessible as guest user. You must login to access it!
  • Dataset

    Italian Common Procurement Vocabulary (CPV)

    This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...
    • ZIP
      The resource: '10007545' is not accessible as guest user. You must login to access it!
  • Dataset

    dolly-15k-it

    This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...
    • jsonl
      The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
  • Dataset

    EVALITA 2020 HT

    This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...
    • ZIP
      The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Highway driving simulation

    The SUMO simulator is used to model scenarios with diferent road topologies and traffc intensities, randomizing the fow of vehicles, to ensure the generation of sufciently...
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...
  • Dataset

    Know your trees dataset

    A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...
    • ZIP
      The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Twitter users retweet

    The dataset was collected using the tweepy API (http://docs.tweepy.org), a Python library for accessing the Twitter API. We selected 14 Twitter accounts, and we obtained all...
  • Access required...

    ×

    Dataset

    Private Protein-Ligand Interaction Graphs for Affinity Studies

    The dataset contains a clean version of the data retrieved from PDBBind in the work of Volkov et al. (2022) that can be used for machine learning-based studies for compound...
  • Access required...

    ×

    Dataset

    Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...

    "A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long...
  • Dataset

    Gene Disease Association Data and Features

    This dataset contains data that can be used for disease gene discovery purposes. The data cover ten different diseases with associated seed genes (derived from DisGeNET) and...
    • RAR
      The resource: 'Gene_Disease_Association_Da ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Twitter EURO2020: BLM debate in Italy

    Twitter Dataset for "Will You Take the Knee? Italian Twitter Echo Chambers' Genesis During EURO 2020" The dataset is comprised of the following files:...
    • JSON
      The resource: 'Twitter EURO2020' is not accessible as guest user. You must login to access it!
  • Dataset

    Reddit Echo Chamber dataset

    In a digital environment, the term echo chamber refers to an alarming phenomenon in which beliefs are amplified or reinforced by communication repetition inside a closed...
    • ZIP
      The resource: 'Reddit Echochamber' is not accessible as guest user. You must login to access it!
  • Dataset

    Brexit dataset

    This dataset comprises a set of online footprints extracted from Twitter using the available APIs. It is centered around the Brexit debate on Twitter from the 2nd until the...
    • RAR
      The resource: 'BrexitDataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental outdoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in outdoor of a smart domestic room located in...
    • The resource: 'IoT_dataset_outdoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental indoor home conditions

    The Multi-sensor dataset of environmental conditions in smart home consists of time series data acquired from sensors deployed in indoor of a smart domestic room located in...
    • RAR
      The resource: 'IoT_dataset_indoor_smart_home' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental office room conditions

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in smart office rooms located in ICAR-CNR, for...
    • RAR
      The resource: 'IoT_dataset_smart_office' is not accessible as guest user. You must login to access it!
  • Dataset

    Multi-sensor dataset of environmental conditions in smart office

    The Multi-sensor dataset of environmental conditions in smart office consists of time series data acquired from sensors deployed in a smart office located in the ICAR CNR IoT...
    • RAR
      The resource: 'Laboratorio IoT' is not accessible as guest user. You must login to access it!
  • Dataset

    User preference-interest dataset

    The User preference-interest dataset is a comprehensive collection of preferences generated by a sequence of 6 regimes following the rules below: - initially, we have...
    • The resource: 'User preference-interest ...' is not accessible as guest user. You must login to access it!
  • Dataset

    RAN and NWDAF data from Cellular Network in Catania

    Dataset containing various RAN and UEs metrics collected from 4 BSs deployed at Piazza D'Uomo, Catania. Metrics can be used for machine learning-based studies for physical...
    • The resource: 'Dataset' is not accessible as guest user. You must login to access it!