Analysis of Text Corpuses From Linguistic Aspects Using Python
dc.contributor.advisor | Tóth, Erzsébet | |
dc.contributor.author | Khalid, Abdullah | |
dc.contributor.department | DE--Informatikai Kar | |
dc.date.accessioned | 2024-06-23T18:33:24Z | |
dc.date.available | 2024-06-23T18:33:24Z | |
dc.date.created | 2024-04-16 | |
dc.description.abstract | Natural Language Preprocessing (NLP) tasks like sentiment analysis and text classification have gained popularity as techniques to estimate investor sentiment towards a certain stock. This study performs a stock sentiment analysis using textual data from Reddit related to the TSLA stock. It employs a degree of machine and deep learning algorithms to train a classifier model for multi-class classification of the text sentiment, incorporating novel techniques for sentiment reproduction. Additionally, the study investigates the impact of data preprocessing techniques such as undersampling and tokenization on the performance of the sentiment analysis model, providing insights into the effectiveness of these methods in handling imbalanced textual data. The research also explores the influence of machine learning on discerning stock market sentiment, interpreting its ramifications on investor behavior and market dynamics. | |
dc.description.course | Programtervező informatikus | |
dc.description.degree | BSc/BA | |
dc.format.extent | 46 | |
dc.identifier.uri | https://hdl.handle.net/2437/374610 | |
dc.language.iso | en | |
dc.rights.access | Hozzáférhető a 2022 decemberi felsőoktatási törvénymódosítás értelmében. | |
dc.subject | Natural Language Processing (NLP) | |
dc.subject | Sentiment Analysis | |
dc.subject | Multi-Class Text Classification | |
dc.subject.dspace | Informatics::Computer Science | |
dc.title | Analysis of Text Corpuses From Linguistic Aspects Using Python |
Fájlok
Eredeti köteg (ORIGINAL bundle)
1 - 1 (Összesen 1)
Nincs kép
- Név:
- thesis.pdf
- Méret:
- 2.11 MB
- Formátum:
- Adobe Portable Document Format
- Leírás:
- thesis
Engedélyek köteg
1 - 1 (Összesen 1)
Nincs kép
- Név:
- license.txt
- Méret:
- 1.95 KB
- Formátum:
- Item-specific license agreed upon to submission
- Leírás: