-
Private Superdiversity dataset
The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The... -
Private Origin and destination attachment from Twitter
The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in... -
EMAKG: Enhanced Microsoft Academic Knowledge Graph
The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and... -
Integrating Direct Intracranial Stimulation with the Human Connectome
Cortical and subcortical direct electrical stimulation (DES) coordinates in MNI space, anonymized patients’ demographic data, and aggregated functional maps for the 12... -
Human and mouse gene regulatory networks
The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available... -
Stroke and sepsi
The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by... -
EUR-Lex MOSTA
This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...-
ZIP
The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Vegetation of a basin of the Po river Dataset
We provide two climatological dataset composed by D = 136 (with 1038 samples) and D = 1991 (with 981 samples) continuous climatological features and a scalar target, which... -
Private EnviroStream
This repository contains datasets, queries and a generator for the EnviroStream, a benchmark for Stream Reasoning (SR) systems. SR focuses on applying inference to dynamic... -
Italian Common Procurement Vocabulary (CPV)
This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...-
ZIP
The resource: '10007545' is not accessible as guest user. You must login to access it!
-
ZIP
-
dolly-15k-it
This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...-
jsonl
The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
-
jsonl
-
EVALITA 2020 HT
This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...-
ZIP
The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Battery State of Health in smart grids Dataset
Smart Grids are the evolution of traditional electric grids and allow two-way flows of electricity and information between different actors. At the edge of this network,... -
Supporting data for "CoVEffect: Interactive System for Mining the Effects of ...
This repository contains the datasets created and extracted for the paper: Giuseppe Serna García, Ruba Al Khalaf, Francesco Invernici, Stefano Ceri, and Anna Bernasconi. 2022.... -
Private Highway driving simulation
The SUMO simulator is used to model scenarios with diferent road topologies and traffc intensities, randomizing the fow of vehicles, to ensure the generation of sufciently... -
Private Cybersecurity NER dataset
Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and... -
Know your trees dataset
A set of images of urban trees in Tortona specifically focusing on images of trees, leaves, bark and habits along with general information, taxonomy, and selected biometric...-
ZIP
The resource: 'Dataset Know Your Trees ...' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Twitter users retweet
The dataset was collected using the tweepy API (http://docs.tweepy.org), a Python library for accessing the Twitter API. We selected 14 Twitter accounts, and we obtained all... -
Private Protein-Ligand Interaction Graphs for Affinity Studies
The dataset contains a clean version of the data retrieved from PDBBind in the work of Volkov et al. (2022) that can be used for machine learning-based studies for compound... -
Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...
"A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long...