期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
多任务学习在不良言论与个体特征检测中的应用
1
作者 肖博健 曹霑懋 许莉芬 《计算机系统应用》 2024年第7期74-83,共10页
多任务学习在自然语言处理领域有广泛应用,但多任务模型往往对任务间的相关性比较敏感.如果任务相关性较低或信息传递不合理,可能会严重影响任务性能.本文提出了一种新的共享-私有结构的多任务学习模型BB-MTL(BERT-BiLSTM multi-task le... 多任务学习在自然语言处理领域有广泛应用,但多任务模型往往对任务间的相关性比较敏感.如果任务相关性较低或信息传递不合理,可能会严重影响任务性能.本文提出了一种新的共享-私有结构的多任务学习模型BB-MTL(BERT-BiLSTM multi-task learning model),并借助元学习的思想为其设计了一种特殊的参数优化方式MLL-TM(meta-learning-like train methods).进一步引入一个新的信息融合门SoWLG(Softmax weighted linear gate),用于选择性地融合每项任务的共享特征与私有特征.实验验证所提出的多任务学习方法,考虑到用户在网络上的行为与其个体特征密切相关,文中结合了不良言论检测、人格检测和情绪检测任务进行了一系列实验.实验结果表明,BB-MTL能够有效学习相关任务中的特征信息,在3项任务上的准确率分别达到了81.56%、77.09%和70.82%. 展开更多
关键词 多任务学习 信息融合 不良言论检测 人格检测 情绪检测
下载PDF
Chaotic Elephant Herd Optimization with Machine Learning for Arabic Hate Speech Detection
2
作者 Badriyya B.Al-onazi Jaber S.Alzahrani +5 位作者 Najm Alotaibi Hussain Alshahrani Mohamed Ahmed Elfaki Radwa Marzouk Heba Mohsen Abdelwahed Motwakel 《Intelligent Automation & Soft Computing》 2024年第3期567-583,共17页
In recent years,the usage of social networking sites has considerably increased in the Arab world.It has empowered individuals to express their opinions,especially in politics.Furthermore,various organizations that op... In recent years,the usage of social networking sites has considerably increased in the Arab world.It has empowered individuals to express their opinions,especially in politics.Furthermore,various organizations that operate in the Arab countries have embraced social media in their day-to-day business activities at different scales.This is attributed to business owners’understanding of social media’s importance for business development.However,the Arabic morphology is too complicated to understand due to the availability of nearly 10,000 roots and more than 900 patterns that act as the basis for verbs and nouns.Hate speech over online social networking sites turns out to be a worldwide issue that reduces the cohesion of civil societies.In this background,the current study develops a Chaotic Elephant Herd Optimization with Machine Learning for Hate Speech Detection(CEHOML-HSD)model in the context of the Arabic language.The presented CEHOML-HSD model majorly concentrates on identifying and categorising the Arabic text into hate speech and normal.To attain this,the CEHOML-HSD model follows different sub-processes as discussed herewith.At the initial stage,the CEHOML-HSD model undergoes data pre-processing with the help of the TF-IDF vectorizer.Secondly,the Support Vector Machine(SVM)model is utilized to detect and classify the hate speech texts made in the Arabic language.Lastly,the CEHO approach is employed for fine-tuning the parameters involved in SVM.This CEHO approach is developed by combining the chaotic functions with the classical EHO algorithm.The design of the CEHO algorithm for parameter tuning shows the novelty of the work.A widespread experimental analysis was executed to validate the enhanced performance of the proposed CEHOML-HSD approach.The comparative study outcomes established the supremacy of the proposed CEHOML-HSD model over other approaches. 展开更多
关键词 Arabic language machine learning elephant herd optimization TF-IDF vectorizer hate speech detection
下载PDF
Comparing Fine-Tuning, Zero and Few-Shot Strategies with Large Language Models in Hate Speech Detection in English
3
作者 Ronghao Pan JoséAntonio García-Díaz Rafael Valencia-García 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第9期2849-2868,共20页
Large Language Models(LLMs)are increasingly demonstrating their ability to understand natural language and solve complex tasks,especially through text generation.One of the relevant capabilities is contextual learning... Large Language Models(LLMs)are increasingly demonstrating their ability to understand natural language and solve complex tasks,especially through text generation.One of the relevant capabilities is contextual learning,which involves the ability to receive instructions in natural language or task demonstrations to generate expected outputs for test instances without the need for additional training or gradient updates.In recent years,the popularity of social networking has provided a medium through which some users can engage in offensive and harmful online behavior.In this study,we investigate the ability of different LLMs,ranging from zero-shot and few-shot learning to fine-tuning.Our experiments show that LLMs can identify sexist and hateful online texts using zero-shot and few-shot approaches through information retrieval.Furthermore,it is found that the encoder-decoder model called Zephyr achieves the best results with the fine-tuning approach,scoring 86.811%on the Explainable Detection of Online Sexism(EDOS)test-set and 57.453%on the Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter(HatEval)test-set.Finally,it is confirmed that the evaluated models perform well in hate text detection,as they beat the best result in the HatEval task leaderboard.The error analysis shows that contextual learning had difficulty distinguishing between types of hate speech and figurative language.However,the fine-tuned approach tends to produce many false positives. 展开更多
关键词 hate speech detection zero-shot few-shot fine-tuning natural language processing
下载PDF
基于谐音干扰词替换的中文仇恨言论检测方法
4
作者 王琰慧 王小龙 +2 位作者 张顺香 周渝皓 汪才钦 《应用科技》 CAS 2024年第3期72-81,共10页
社交网络中的仇恨言论常含有形式多变的谐音干扰词,使得现有方法难以适应此现象,不能满足即时检测的要求。针对此问题,提出一种基于谐音干扰词替换的中文仇恨言论检测方法,提取原义词替换谐音干扰词,解决原有方法处理相对滞后问题。首先... 社交网络中的仇恨言论常含有形式多变的谐音干扰词,使得现有方法难以适应此现象,不能满足即时检测的要求。针对此问题,提出一种基于谐音干扰词替换的中文仇恨言论检测方法,提取原义词替换谐音干扰词,解决原有方法处理相对滞后问题。首先,对文本预处理,通过N-gram提取干扰词候选项,并利用点间互信息和邻接熵进行过滤;然后,计算拼音相似度筛选出谐音干扰词及其对应的候选原义词,通过语法结构和上下文语义相似确定原义词并对相应谐音干扰词进行替换,将替换后的文本作为分类层输入;最后,使用RoBERTa-wmm-ext得到语义特征,并通过Softmax计算仇恨情感倾向以实现检测任务。在数据集上进行实验,结果表明提出的模型有效地提升中文仇恨言论的检测效果。 展开更多
关键词 仇恨言论检测 谐音干扰词 拼音相似 语法结构 上下文语义 RoBERTa-wmm-ext CNN N-GRAM
下载PDF
An Adaptive Hate Speech Detection Approach Using Neutrosophic Neural Networks for Social Media Forensics
5
作者 Yasmine M.Ibrahim Reem Essameldin Saad M.Darwish 《Computers, Materials & Continua》 SCIE EI 2024年第4期243-262,共20页
Detecting hate speech automatically in social media forensics has emerged as a highly challenging task due tothe complex nature of language used in such platforms. Currently, several methods exist for classifying hate... Detecting hate speech automatically in social media forensics has emerged as a highly challenging task due tothe complex nature of language used in such platforms. Currently, several methods exist for classifying hatespeech, but they still suffer from ambiguity when differentiating between hateful and offensive content and theyalso lack accuracy. The work suggested in this paper uses a combination of the Whale Optimization Algorithm(WOA) and Particle Swarm Optimization (PSO) to adjust the weights of two Multi-Layer Perceptron (MLPs)for neutrosophic sets classification. During the training process of the MLP, the WOA is employed to exploreand determine the optimal set of weights. The PSO algorithm adjusts the weights to optimize the performanceof the MLP as fine-tuning. Additionally, in this approach, two separate MLP models are employed. One MLPis dedicated to predicting degrees of truth membership, while the other MLP focuses on predicting degrees offalse membership. The difference between these memberships quantifies uncertainty, indicating the degree ofindeterminacy in predictions. The experimental results indicate the superior performance of our model comparedto previous work when evaluated on the Davidson dataset. 展开更多
关键词 hate speech detection whale optimization neutrosophic sets social media forensics
下载PDF
A Review of Machine Learning Techniques in Cyberbullying Detection 被引量:1
6
作者 Daniyar Sultan Batyrkhan Omarov +5 位作者 Zhazira Kozhamkulova Gulnur Kazbekova Laura Alimzhanova Aigul Dautbayeva Yernar Zholdassov Rustam Abdrakhmanov 《Computers, Materials & Continua》 SCIE EI 2023年第3期5625-5640,共16页
Automatic identification of cyberbullying is a problem that is gaining traction,especially in the Machine Learning areas.Not only is it complicated,but it has also become a pressing necessity,considering how social me... Automatic identification of cyberbullying is a problem that is gaining traction,especially in the Machine Learning areas.Not only is it complicated,but it has also become a pressing necessity,considering how social media has become an integral part of adolescents’lives and how serious the impacts of cyberbullying and online harassment can be,particularly among teenagers.This paper contains a systematic literature review of modern strategies,machine learning methods,and technical means for detecting cyberbullying and the aggressive command of an individual in the information space of the Internet.We undertake an in-depth review of 13 papers from four scientific databases.The article provides an overview of scientific literature to analyze the problem of cyberbullying detection from the point of view of machine learning and natural language processing.In this review,we consider a cyberbullying detection framework on social media platforms,which includes data collection,data processing,feature selection,feature extraction,and the application ofmachine learning to classify whether texts contain cyberbullying or not.This article seeks to guide future research on this topic toward a more consistent perspective with the phenomenon’s description and depiction,allowing future solutions to be more practical and effective. 展开更多
关键词 CYBERBULLYING hate speech digital drama online harassment detection classification machine learning NLP
下载PDF
Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks 被引量:5
7
作者 Femi Emmanuel Ayo Olusegun Folorunso +1 位作者 Friday Thomas Ibharalu Idowu Ademola Osinuga 《International Journal of Intelligent Computing and Cybernetics》 EI 2020年第4期485-525,共41页
Purpose-Hate speech is an expression of intense hatred.Twitter has become a popular analytical tool for the prediction and monitoring of abusive behaviors.Hate speech detection with social media data has witnessed spe... Purpose-Hate speech is an expression of intense hatred.Twitter has become a popular analytical tool for the prediction and monitoring of abusive behaviors.Hate speech detection with social media data has witnessed special research attention in recent studies,hence,the need to design a generic metadata architecture and efficient feature extraction technique to enhance hate speech detection.Design/methodology/approach-This study proposes a hybrid embeddings enhanced with a topic inference method and an improved cuckoo search neural network for hate speech detection in Twitter data.The proposed method uses a hybrid embeddings technique that includes Term Frequency-Inverse Document Frequency(TF-IDF)for word-level feature extraction and Long Short Term Memory(LSTM)which is a variant of recurrent neural networks architecture for sentence-level feature extraction.The extracted features from the hybrid embeddings then serve as input into the improved cuckoo search neural network for the prediction of a tweet as hate speech,offensive language or neither.Findings-The proposed method showed better results when tested on the collected Twitter datasets compared to other related methods.In order to validate the performances of the proposed method,t-test and post hoc multiple comparisons were used to compare the significance and means of the proposed method with other related methods for hate speech detection.Furthermore,Paired Sample t-Test was also conducted to validate the performances of the proposed method with other related methods.Research limitations/implications-Finally,the evaluation results showed that the proposed method outperforms other related methods with mean F1-score of 91.3.Originality/value-The main novelty of this study is the use of an automatic topic spotting measure based on na€ıve Bayes model to improve features representation. 展开更多
关键词 TWITTER hate speech detection EMBEDDINGS Cuckoo search Neural networks
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部