Detection of spam-posting accounts on Twitter

NLPIR SEMINAR Y2019#14

INTRO

In the new semester, our Lab, Web Search Mining and Security Lab, plans to hold an academic seminar every Monday, and each time a keynote speaker will share understanding of papers on his/her related research with you.

Arrangement

This week’s seminar is organized as follows:

The seminar time is 1.pm, Mon, at Zhongguancun Technology Park ,Building 5, 1306.
Asif introduce the reviewing paper.
Nihad introduce the reviewing paper.
The lecturer is Ilham, the paper’s title is Detection of spam-posting accounts on Twitter.
The seminar will be hosted by Zhaoyou Liu.
Attachment is the paper of this seminar, please download in advance.

Everyone interested in this topic is welcomed to join us. the following is the abstract for this week’s paper.

Detection of spam-posting accounts on Twitter

Isa Inuwa-Dutse, Mark Liptrott, Ioannis Korkontzelos

Abstract

Online Social Media platforms, such as Facebook and Twitter, enable all users, independently of their characteristics, to freely generate and consume huge amounts of data. While this data is being exploited by individuals and organisations to gain competitive advantage, a substantial amount of data is being generated by spam or fake users. One in every 200 social media messages and one in every 21 tweets is estimated to be spam. The rapid growth in the volume of global spam is expected to compromise research works that use social media data, thereby questioning data credibility. Motivated by the need to identify and filter out spam contents in social media data, this study presents a novel approach for distinguishing spam vs. non-spam social media posts and offers more insight into the behaviour of spam users on Twitter. The approach proposes an optimised set of features independent of historical tweets, which are only available for a short time on Twitter. We take into account features related to the users of Twitter, their accounts and their pairwise engagement with each other. We experimentally demonstrate the efficacy and robustness of our approach and compare it to a typical feature set for spam detection in the literature, achieving a significant improvement on performance. In contrast to prior research findings, we observe that an average automated spam account posted at least 12 tweets per day at well defined periods. Our method is suitable for real-time deployment in a social media data collection pipeline as an initial preprocessing strategy to improve the validity of research data.

Detection-of-spam-posting-accounts-on-Twitter 下载

Pages: 1 2

自然语言处理与信息检索共享平台

Natural Language Processing & Information Retrieval Sharing Platform 自然语言处理、大数据实验室、智能语义平台汉语分词、中文语义分析、中文信息处理、语义分析系统、中文知识图谱、大数据分析工具

NLPIR SEMINAR Y2019#14

INTRO

Arrangement

About the Author: nlpvv

发表回复取消回复

NLPIR SEMINAR Y2019#14

INTRO

Arrangement

You May Also Like

张华平教授获全国工业和信息化系统先进工作者

【转载】DeepSeek启示:可信可控可用的大模型未来之路

About the Author: nlpvv

发表回复 取消回复

发表回复取消回复