Institutional Scholarship

Sentiment Analysis of Egyptian Arabic in Social Media

Show simple item record

dc.contributor.advisor Kumar, Deepak
dc.contributor.advisor Darwish, Manar
dc.contributor.author Abdalkader, Mohamed
dc.date.accessioned 2014-12-01T15:49:00Z
dc.date.available 2014-12-01T15:49:00Z
dc.date.issued 2014
dc.identifier.uri http://hdl.handle.net/10066/15080
dc.description.abstract Sentiment analysis is an emerging area of application fueled by the increase of public participation in online social media. Much work has been done on sentiment analysis in English while less work has been done on other languages like Mandarin and Arabic. Arabic is spoken by hundreds of millions of people in over twenty countries. Modern Standard Arabic (MSA) is used online mostly by newspapers and other official sources. However, social media and blogs used by individuals are typically in Dialect Arabic (DA). My Senior Thesis work has been focused on exploring ways to increase the accuracy of automated sentiment analysis in Egyptian Arabic through using the specific features of Arabic. I found that the baseline algorithm makes the most mistakes in classifying tweets that carry a sentiment as neutral tweets. Using Minimum Edit Distance (MED) and ISRI Arabic stemmer, I was able to decrease the error of the baseline algorithm by 31% without having to add any new entries to the lexicon. My approach has allowed me to not only get over the challenge of different morphological forms but also misspelling and informal writing. While I cannot empirically compare it to results by other authors as I am using a different data set, my approach reaches an accuracy of 78% which has an improvement of 14.7% over the baseline.
dc.description.sponsorship Haverford College. Department of Computer Science
dc.language.iso eng
dc.rights.uri http://creativecommons.org/licenses/by-nc/3.0/us/
dc.subject.lcsh Natural language processing (Computer science)
dc.subject.lcsh Computational linguistics
dc.subject.lcsh Public opinion -- Data processing
dc.subject.lcsh Data mining
dc.subject.lcsh Egyptian language -- Computer programs
dc.subject.lcsh Egyptian language -- Data processing
dc.title Sentiment Analysis of Egyptian Arabic in Social Media
dc.type Thesis
dc.rights.access Haverford users only


Files in this item

This item appears in the following Collection(s)

Show simple item record

http://creativecommons.org/licenses/by-nc/3.0/us/ Except where otherwise noted, this item's license is described as http://creativecommons.org/licenses/by-nc/3.0/us/

Search


Browse

My Account