-
Private Superdiversity dataset
The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The... -
Private Origin and destination attachment from Twitter
The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in... -
Private Air Traffic Data International Mobility Indicators for the UK
The Air Traffic Data International Mobility Indicators for the UK results from the investigation on air passenger data. Starting from air passenger traffic volumes from each... -
EMAKG: Enhanced Microsoft Academic Knowledge Graph
The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and... -
Integrating Direct Intracranial Stimulation with the Human Connectome
Cortical and subcortical direct electrical stimulation (DES) coordinates in MNI space, anonymized patients’ demographic data, and aggregated functional maps for the 12... -
Human and mouse gene regulatory networks
The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available... -
Stroke and sepsi
The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by... -
Bark Beetle Outbreak Czech Republic
Repository containing satellite dataset created for bark beetle outbreak detection in satellite (Sentinel-1 and Sentinel-2) images. The dataset refer to scenes observed in... -
EUR-Lex MOSTA
This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...-
ZIP
The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Vegetation of a basin of the Po river Dataset
We provide two climatological dataset composed by D = 136 (with 1038 samples) and D = 1991 (with 981 samples) continuous climatological features and a scalar target, which... -
Private EnviroStream
This repository contains datasets, queries and a generator for the EnviroStream, a benchmark for Stream Reasoning (SR) systems. SR focuses on applying inference to dynamic... -
Italian Common Procurement Vocabulary (CPV)
This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...-
ZIP
The resource: '10007545' is not accessible as guest user. You must login to access it!
-
ZIP
-
dolly-15k-it
This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...-
jsonl
The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
-
jsonl
-
EVALITA 2020 HT
This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...-
ZIP
The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Battery State of Health in smart grids Dataset
Smart Grids are the evolution of traditional electric grids and allow two-way flows of electricity and information between different actors. At the edge of this network,... -
Supporting data for "CoVEffect: Interactive System for Mining the Effects of ...
This repository contains the datasets created and extracted for the paper: Giuseppe Serna García, Ruba Al Khalaf, Francesco Invernici, Stefano Ceri, and Anna Bernasconi. 2022.... -
Private Highway driving simulation
The SUMO simulator is used to model scenarios with diferent road topologies and traffc intensities, randomizing the fow of vehicles, to ensure the generation of sufciently... -
Private Cybersecurity NER dataset
Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and... -
Know your trees dataset
A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...-
ZIP
The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Twitter users retweet
The dataset was collected using the tweepy API (http://docs.tweepy.org), a Python library for accessing the Twitter API. We selected 14 Twitter accounts, and we obtained all...