-
Private Superdiversity dataset
The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The... -
Private Origin and destination attachment from Twitter
The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in... -
Private Cybersecurity NER BERT-base-cased model
This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that... -
Cybersecurity NER RoBERTa-base model
This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
py
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Cybersecurity NER SecureBERT model
This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
text/x-python
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
EMAKG: Enhanced Microsoft Academic Knowledge Graph
The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and... -
Integrating Direct Intracranial Stimulation with the Human Connectome
Cortical and subcortical direct electrical stimulation (DES) coordinates in MNI space, anonymized patients’ demographic data, and aggregated functional maps for the 12... -
Private ltlf2asp
Linear Temporal Logic over Finite Traces (LTLf) is a popular logic to reason about finite sequences of events. In LTLf, the (bounded) satisfiability problem refers to whether... -
Human and mouse gene regulatory networks
The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available... -
Stroke and sepsi
The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by... -
EUR-Lex MOSTA
This dataset contains 4176 non-empty official public EU legal judgments that were finalized between 2008 and 2018, categorized in one or more subject matters, that fall within...-
ZIP
The resource: 'EUR-Lex MOSTA' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private Vegetation of a basin of the Po river Dataset
We provide two climatological dataset composed by D = 136 (with 1038 samples) and D = 1991 (with 981 samples) continuous climatological features and a scalar target, which... -
Private Dynamical Linear Upper Confidence Bound (DynLin-UCB)
The repository contains the code to run DynLin-UCB (Dynamical Linear Upper Confidence Bound). DynLin-UCB is an optimistic regret-minimization algorithm that can be used to... -
Private EnviroStream
This repository contains datasets, queries and a generator for the EnviroStream, a benchmark for Stream Reasoning (SR) systems. SR focuses on applying inference to dynamic... -
Private Environmental Monitoring of Fluorescence Response
We study a novel sequential decision-making setting, namely the dissimilarity bandits. At each round, the learner pulls an arm that provides a stochastic d-dimensional... -
Italian Common Procurement Vocabulary (CPV)
This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...-
ZIP
The resource: '10007545' is not accessible as guest user. You must login to access it!
-
ZIP
-
dolly-15k-it
This dataset is obtained by automatically translating the dolly 15k dataset. The dolly-15k dataset is an open-source dataset of instruction-following records generated by...-
jsonl
The resource: 'dolly-15k-it' is not accessible as guest user. You must login to access it!
-
jsonl
-
EVALITA 2020 HT
This dataset is obtained by transforming the training and test data of the two EVALITA tasks into an LLM prompt following a template. The tasks involved are AMI2020 (misogyny...-
ZIP
The resource: 'EVALITA_2020_bloom_it' is not accessible as guest user. You must login to access it!
-
ZIP
-
EnviroStream (Benchmark)
Stream Reasoning (SR) focuses on developing advanced approaches for applying inference to dynamic data streams; it has become increasingly relevant in various application... -
Private Battery State of Health in smart grids Dataset
Smart Grids are the evolution of traditional electric grids and allow two-way flows of electricity and information between different actors. At the edge of this network,...