Aspect-based Sentiment Classification Model Employing Dialect Normalization and Deep Learning

Dahou, Abdelghani / Supervisor

Please use this identifier to cite or link to this item: https://dspace.univ-adrar.edu.dz/jspui/handle/123456789/7924

Title:	Aspect-based Sentiment Classification Model Employing Dialect Normalization and Deep Learning
Authors:	Dahou, Abdelghani / Supervisor
Keywords:	sentiment analysis aspect based sentiment analysis dialect normalization aspect term polarity aspect category polaity تحليل المشاعر ،تحليل المشاعر القائم على الجانب ، تطبيع اللهجة ، تحديد قطبية مصطلح الجانب ، تحديد مصطلح فئة الجانب Degree in Computer Science
Issue Date:	2022
Publisher:	University Ahmed DRAIA of Adrar
Abstract:	Sentiment analysis (SA) is one of the most important fields of natural language processing (NLP) due to its wide range of applications and the benefits associated with using it. It is defined as the process of identifying the sentiment polarity (positive, negative, or neutral) of a natural language given text. Researchers have recently focused their attention on Arabic SA due to the massive amounts of user-generated content on social media and e-commerce websites in the Arab world. Most of the research in this fieldwork is on the sentence and document levels. This study tackles the aspect-level sentiment analysis for the Arabic language, which is a less studied version of SA. Because Arabic NLP is challenging, few Arabic annotated corpora available, and the massive number of Arabic dialects, limited studies have attempted to detect aspect-based sentiment analysis on Arabic texts. Specifically, this study considers two ABSA tasks: aspect term polarity and aspect category polarity, and using the text normalization of the Arabic dialect after making the classification task. We present a Seq2Seq model for dialect normalization that can serve as a pre-processing step for the ABSA classification task by reducing the number of OOV words. Thus, the model's accuracy increased. The results of the conducted experiments show that our models outperformed the existing models in literature on both tasks and datasets Sentiment analysis (SA) is one of the most important fields of natural language processing (NLP) due to its wide range of applications and the benefits associated with using it. It is defined as the process of identifying the sentiment polarity (positive, negative, or neutral) of a natural language given text. Researchers have recently focused their attention on Arabic SA due to the massive amounts of user-generated content on social media and e-commerce websites in the Arab world. Most of the research in this fieldwork is on the sentence and document levels. This study tackles the aspect-level sentiment analysis for the Arabic language, which is a less studied version of SA. Because Arabic NLP is challenging, few Arabic annotated corpora available, and the massive number of Arabic dialects, limited studies have attempted to detect aspect-based sentiment analysis on Arabic texts. Specifically, this study considers two ABSA tasks: aspect term polarity and aspect category polarity, and using the text normalization of the Arabic dialect after making the classification task. We present a Seq2Seq model for dialect normalization that can serve as a pre-processing step for the ABSA classification task by reducing the number of OOV words. Thus, the model's accuracy increased. The results of the conducted experiments show that our models outperformed the existing models in literature on both tasks and datasets يعد تحليل المشاعر أحد أهم مجالات معالجة اللغة الطبيعية نظرًا لتطبيقاته الواسعة النطاق والفوائد المرتبطة باستخدامه. يتم تعريفه على أنه عملية تحديد قطبية المشاعر (إيجابية أو سلبية أو محايدة) لنص معطى بلغة طبيعية. ركز الباحثون مؤخرًا على الاهتمام بتحليل المشاعر العربية بسبب الكميات الهائلة من المحتويات التي ينشئها المستخدمون على مواقع التواصل الاجتماعي ومواقع التجارة الإلكترونية في العالم العربي. تعمل معظم الأبحاث في هذا المجال على مستويي الجملة والوثيقة. تتناول هذه الدراسة تحليل المشاعر على مستوى الجانب للغة العربية وهي النسخة الأقل دراسة في مجال تحليل المشاعر. نظرًا لأن البرمجة اللغوية العصبية للغة العربية يمثل تحديًا والعدد القليل من مجموعة البيانات العربية المتاحة والعدد الهائل من اللهجات العربية ، فقد حاولت دراسات محدودة تناول موضوع تحليل المشاعر القائم على الجوانب في النصوص العربية. على وجه التحديد ، تعالج هذه الدراسة في مهمتين من مهام تحليل المشاعر القائم على الجانب: تحديد قطبية مصطلح الجانب و تحديد قطبية فئة الجانب ، وتستفيد من تطبيع النص في اللهجة العربية بعد إجراء مهمة التصنيف. بتعبير أدق ، نقدم نموذج تطبيع اللهجات والذي يمكن أن يكون بمثابة خطوة معالجة مسبقة لمهمة تصنيف تحليل المشاعر القائم على الجانب عن طريق تقليل عدد الكلمات الخارجة عن قاموس مفردات النموذج وبالتالي زيادة دقة النماذج الخاصة بنا. تظهر النتائج المحققة للتجارب التي تم إجراؤها أن نماذجنا تفوقت على النماذج الموجودة في الدراسات السابقة في كل من المهام ومجموعات البيانات.
Description:	Degree in Computer Science
URI:	https://dspace.univ-adrar.edu.dz/jspui/handle/123456789/7924
Appears in Collections:	Mémoires de Master

Files in This Item:

File	Description	Size	Format
Aspect_based_sentiment_analisys_Clasification_Model_employinf_dialect.pdf		1.15 MB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets