-
Private Cybersecurity NER dataset
Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and... -
Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...
Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...-
CSV
The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
-
CSV
-
DNA 12-mers
A 179 MB dataset containing all the ~14M unique 12-mers in the DNA sequences available in the Pizza&Chili Corpus (https://pizzachili.dcc.uchile.cl/texts.html). This dataset...-
ZIP
The resource: 'DNA 12-mers' is not accessible as guest user. You must login to access it!
-
ZIP
-
FANCY Dataset
(NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,... -
Santorini Tweets July-August 2021
This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...-
ZIP
The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!
-
ZIP
-
Lexical networks from Lithuanian news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Lithuanian news articles extracted from the dataset...-
jsonl
The resource: 'lithuanian_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
Lexical networks from Swedish news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Swedish news articles extracted from the dataset described...-
jsonl
The resource: 'swedish_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
Lexical networks from Croatian news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Croatian news articles extracted from the dataset...-
jsonl
The resource: 'croatian_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
Lexical networks from Finnish news articles
The dataset includes lexical networks centered on keywords related to migration. The networks are built starting from Finnish news articles extracted from the dataset...-
jsonl
The resource: 'finnish_egoNet_w4' is not accessible as guest user. You must login to access it!
-
jsonl
-
Semantic Networks from news articles (Romanian sample)
The Semantic Networks from news articles (Romanian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Romanian_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (Dutch sample)
The Semantic Networks from news articles (Dutch sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Dutch_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (German sample)
The Semantic Networks from news articles (German sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'German_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (Portuguese sample)
The Semantic Networks from news articles (Portuguese sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Portuguese_sampleNet_anonym ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (Spanish sample)
The Semantic Networks from news articles (Spanish sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Spanish_sampleNet_anonymized' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (French sample)
The Semantic Networks from news articles (French sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Semantic Networks from news articles (Italian sample)
The Semantic Networks from news articles (Italian sample) contains semantic networks for a sample of migration-related news articles extracted from the dataset described in...-
CSV
The resource: 'Semantic Networks from ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Gene-specific regularization for COPD partial-correlation estimation
We introduce a gene-specific regularization factor when computing the Partial Correlation score to make the indeterminate regression feasible. We decided to slightly modify... -
BioTAGME: A comprehensive platform for biological knowledge network analysis
This Network was built through BioTAGME, a system that combines TAGME, an entity-annotation framework based on Wikipedia corpus with a network-based inference methodology (i.e.,... -
Emergency Tweets 2016 Amatrice earthquake
This dataset contais Italian tweets related to the earthquake of 2016 in the Centre of Italy (https://it.wikipedia.org/wiki/Terremoto_del_Centro_Italia_del_2016_e_d...). is...-
ZIP
The resource: 'EAQ-AMA.zip' is not accessible as guest user. You must login to access it!
-
ZIP
-
Emergency Tweets 2013 Sardinia flood
This dataset is related to the floods that occurred in the Sardinia regional district between 17 and 19 November 2013 (https://en.wikipedia.org/wiki/2013_Sardinia_floods), as...-
ZIP
The resource: 'FLO-SAR.zip' is not accessible as guest user. You must login to access it!
-
ZIP