77 items found

Licenses: Academic Free License 3.0 Groups: sobigdata-it

Filter Results
  • Access required...

    ×

    Dataset

    Private Superdiversity dataset

    The Superdiversity dataset includes the Superdiversity Index (SI) calculated on the diversity of the emotional content expressed in texts of different communities. The...
  • Access required...

    ×

    Dataset

    Private Origin and destination attachment from Twitter

    The cultural integration of immigrants conditions their overall socio-economic integration as well as natives' attitudes towards globalisation in general and immigration in...
  • JournalArticle

    Where do migrants and natives belong in a community: a Twitter case study and...

    Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours...
    • The resource: 'Link to paper' is not accessible as guest user. You must login to access it!
  • BookChapter

    Twitter data for migration studies

    Handbook of using Twitter to study migration.
    • The resource: 'Link to chapter.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Origin and destination attachment: study of cultural integration on Twitter

    The cultural integration of immigrants conditions their overall socio-economic integration as well as natives’ attitudes towards globalisation in general and immigration in...
    • HTML
      The resource: 'Link to article.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Measuring the Salad Bowl: Superdiversity on Twitter

    Superdiversity refers to large cultural diversity in a population due to immigration. In this paper, we introduce a superdiversity index based on the changes in the emotional...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    International mobility between the UK and Europe around Brexit: a data-driven...

    Among the multiple effects of Brexit, changes in migration and mobility across Europe were expected. Several studies have analysed these aspects, mostly from the point of view...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • BookChapter

    How Can Big Data Analytics Help Understand Migrant Integration?

    Adequate data are key for evidence-based policymaking. However, while a large amount of official statistics is produced across European Union member States, only a small part...
    • The resource: 'Link to handbook.' is not accessible as guest user. You must login to access it!
  • ConferencePaper

    Digital footprints of international migration on twitter

    Studying migration using traditional data has some limitations. To date, there have been several studies proposing innovative methodologies to measure migration stocks and...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Combining Twitter and Mobile Phone Data to Observe Border-Rush: The Turkish-E...

    Following Turkey's 2020 decision to revoke border controls, many individuals journeyed towards the Greek, Bulgarian, and Turkish borders. However, the lack of verifiable...
    • The resource: 'Link to article.' is not accessible as guest user. You must login to access it!
  • ConferencePaper

    Characterising different communities of Twitter users: migrants and natives

    Today, many users are actively using Twitter to express their opinions and to share information. Thanks to the availability of the data, researchers have studied behaviours...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • JournalArticle

    Academic mobility from a big data perspective

    Understanding the careers and movements of highly skilled people plays an ever-increasing role in today’s global knowledge-based economy. Researchers and academics are sources...
    • The resource: 'Link to paper.' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private Cybersecurity NER BERT-base-cased model

    This method includes a Python script and files of a BERT-base-cased model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that...
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Dataset

    EMAKG: Enhanced Microsoft Academic Knowledge Graph

    The EMAKG is a large dataset of scientific publications and related entities such as authors, affiliations, venues, and fields of study. Data includes authors' careers and...
    • The resource: 'Link to dataset.' is not accessible as guest user. You must login to access it!
    • The resource: 'Link to scientific ...' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Method

    Private ltlf2asp

    Linear Temporal Logic over Finite Traces (LTLf) is a popular logic to reason about finite sequences of events. In LTLf, the (bounded) satisfiability problem refers to whether...
  • Dataset

    Human and mouse gene regulatory networks

    The dataset was built by considering gene expression data related to 6 different organs (liver, lung, brain, skin, bone marrow, heart), obtained by control samples available...
    • The resource: 'Human and mouse gene ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Stroke and sepsi

    The considered stroke dataset (DOI:10.17632/x8ygrw87jw.1, DOI:10.1016/j.artmed.2019.101723) was pre-processed by removing attributes with more than 30% missing values, by...
    • The resource: 'Stroke and sepsi' is not accessible as guest user. You must login to access it!
  • Access required...

    ×

    Dataset

    Private Cybersecurity NER dataset

    Our dataset is created by merging APTNER and CyNER datasets, containing 13601 sentences, 347779 tokens, and 37684 entities. The split ratio was roughly 70% for training and...