A prototype deep learning paraphrase identification service for discovering information cascades in social networks

  • Panagiotis Kasnesis
  • , Ryan Heartfield
  • , Lazaros Toumanidis
  • , Xing Liang
  • , George Loukas
  • , Charalampos Patrikakis

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Identifying the provenance of information posted on social media and how this information may have changed over time can be very helpful in assessing its trustworthiness. Here, we introduce a novel mechanism for discovering "post-based" information cascades, including the earliest relevant post and how its information has evolved over subsequent posts. Our prototype leverages multiple innovations in the combination of dynamic data sub-sampling and multiple natural language processing and analysis techniques, benefiting from deep learning architectures. We evaluate its performance on EMTD, a dataset that we have generated from our private experimental instance of the decentralised social network Mastodon, as well as the benchmark Microsoft Research Paraphrase Corpus, reporting no errors in sub-sampling based on clustering, and an average accuracy of 92% and F1 score of 93% for paraphrase identification.
    Original languageEnglish
    Title of host publicationPublished in: IEEE International Conference on Multimedia and Expo Workshops (ICMEW) 2020. Piscataway, U.S. : Institute of Electrical and Electronics Engineers, Inc. ISBN 9781728114859 This work was supported by EUNOMIA project [Grant Number: 825171]. Organising Body: Institute of Electrical and Electronics Engineers (IEEE) Organising Body: Institute of Electrical and Electronics Engineers (IEEE)
    DOIs
    Publication statusPublished - 7 Jul 2020

    Bibliographical note

    Note: Published in: IEEE International Conference on Multimedia and Expo Workshops (ICMEW) 2020. Piscataway, U.S. : Institute of Electrical and Electronics Engineers, Inc. ISBN 9781728114859

    This work was supported by EUNOMIA project [Grant Number: 825171].

    Organising Body: Institute of Electrical and Electronics Engineers (IEEE)

    Organising Body: Institute of Electrical and Electronics Engineers (IEEE)

    Keywords

    • Information cascade
    • Clustering
    • Deep learning
    • Paraphrase Identification
    • Computer science and informatics

    Fingerprint

    Dive into the research topics of 'A prototype deep learning paraphrase identification service for discovering information cascades in social networks'. Together they form a unique fingerprint.

    Cite this