Others - SoBigData.eu Catalogue

Access required...

×

Method

Private Cybersecurity NER BERT-base-cased model

This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that...

Method

Cybersecurity NER RoBERTa-base model

This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...

JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
JSON
The resource: 'vocab' is not accessible as guest user. You must login to access it!
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
py
The resource: 'inference' is not accessible as guest user. You must login to access it!

Dataset

FANCY Dataset

(NLI) FANCY (FActivity, Negation, Common-sense, hYpernimy) is a new dataset with 4000 sentence pairs concerning complex linguistic phenomena such as factivity, negation,...

The resource: 'FANCY Dataset' is not accessible as guest user. You must login to access it!

Dataset

Santorini Tweets July-August 2021

This dataset contains 225.501 tweets written by 141.277 users. These tweets are geolocated in Santorini, or they contain the word or the hashtag "santorini" in the text. They...

ZIP
The resource: 'tweet_santorini.csv' is not accessible as guest user. You must login to access it!

Access required...

×

Method

Private Distributed W2V

Accelerated training of Word Embeddings for large text corpora. Creates a word2vec-model from an input corpus of tokenized texts through the use of parallel distributed...

Dataset

The Italian Music Dataset

The dataset is built by exploiting the Spotify and SoundCloud APIs. It is composed of over 14,500 different songs of both famous and less famous Italian musicians. Each song...

JSON
The resource: 'Dataset' is not accessible as guest user. You must login to access it!

Dataset

Conversational search dataset with labels

CAsT 2019 data is split into two files one for training and the other one for testing. - Training set: CAsT 2019 conversations from training set and from test set without...

The resource: 'Conversational dataset ...' is not accessible as guest user. You must login to access it!

Dataset

Dataset for Evaluating Abstractive Summaries of Crisis-Related Social Media

The dataset created for evaluation of summaries generated from social media posted during five natural disasters. The dataset contains: ground truth reports created by human...

The resource: 'Dataset for Evaluating ...' is not accessible as guest user. You must login to access it!

Access required...

×

Method

Private Ecology of the digital world of Wikipedia

Wikipedia, a paradigmatic example of online knowledge space is organized in a collaborative, bottom-up way with voluntary contributions, yet it maintains a level of reliability...

9 items found

Access required...

Private Cybersecurity NER BERT-base-cased model

Cybersecurity NER RoBERTa-base model

FANCY Dataset

Santorini Tweets July-August 2021

Access required...

Private Distributed W2V

The Italian Music Dataset

Conversational search dataset with labels

Dataset for Evaluating Abstractive Summaries of Crisis-Related Social Media

Access required...

Private Ecology of the digital world of Wikipedia