<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Sneh’s Substack: Publications]]></title><description><![CDATA[This section features my research contributions and publications, highlighting work in machine learning, reinforcement learning, and data-driven systems.]]></description><link>https://www.snehvora.me/s/publications</link><image><url>https://substackcdn.com/image/fetch/$s_!9m8J!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9157932e-985e-4b66-8b94-e3258376ea5c_1280x1280.png</url><title>Sneh’s Substack: Publications</title><link>https://www.snehvora.me/s/publications</link></image><generator>Substack</generator><lastBuildDate>Thu, 30 Apr 2026 18:35:48 GMT</lastBuildDate><atom:link href="https://www.snehvora.me/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Sneh Vora]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[snehvora@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[snehvora@substack.com]]></itunes:email><itunes:name><![CDATA[Sneh Vora]]></itunes:name></itunes:owner><itunes:author><![CDATA[Sneh Vora]]></itunes:author><googleplay:owner><![CDATA[snehvora@substack.com]]></googleplay:owner><googleplay:email><![CDATA[snehvora@substack.com]]></googleplay:email><googleplay:author><![CDATA[Sneh Vora]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Twitter Sentiment Analysis with Textblob]]></title><description><![CDATA[International Journal of Innovative Science and Research Technology &#183; Dec 3, 2022]]></description><link>https://www.snehvora.me/p/twitter-sentiment-analysis-with-textblob</link><guid isPermaLink="false">https://www.snehvora.me/p/twitter-sentiment-analysis-with-textblob</guid><dc:creator><![CDATA[Sneh Vora]]></dc:creator><pubDate>Sat, 06 Sep 2025 21:35:43 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/45eda83d-f0c6-4610-9cb2-81530c9b3410_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h3>When I Taught Tweets to Speak: The Story Behind My Research</h3><p>It all started with a simple question that kept nagging at me: <em>What do people really feel when they post on Twitter?</em></p><p>In the whirlwind of digital conversations, Twitter has always fascinated me. Millions of people drop their thoughts every second&#8212;sometimes raw, sometimes witty, sometimes brutally honest. But beneath the chaos of hashtags, emojis, and abbreviations, I sensed a pattern. A hidden pulse.</p><p>And I wanted to capture it.</p><p>That curiosity became the seed of my research paper: <strong>&#8220;Twitter Sentiment Analysis using TextBlob&#8221;</strong> (<a href="https://ijisrt.com/assets/upload/files/IJISRT22NOV190_(1).pdf?utm_source=chatgpt.com">IJISRT, 2022</a>).</p><div><hr></div><h4>Why Twitter?</h4><p>When I began, I could have chosen Instagram, Facebook, or even Reddit. But Twitter stood out because of its brevity. Each tweet forces the user to compress their feelings into 280 characters. It&#8217;s like a stream of distilled human emotion&#8212;short, sharp, and surprisingly revealing.</p><p>For an ML engineer, that&#8217;s both a gift and a challenge.</p><p>The gift? Massive volumes of text-based interactions, perfect for analysis.<br>The challenge? Tweets are messy. Really messy.</p><div><hr></div><h4>The Early Struggles</h4><p>I still remember my first dataset. It was a jungle.</p><ul><li><p>Links to half-broken websites.</p></li><li><p>Random emojis.</p></li><li><p>Retweet markers.</p></li><li><p>And let&#8217;s not even talk about the spelling errors.</p></li></ul><p>Running my first scripts felt like staring into static on an old TV screen. I had data, sure&#8212;but no clarity.</p><p>That&#8217;s when I realized: before I could even think about machine learning, I had to get serious about <strong>cleaning</strong>.</p><p>I spent hours designing preprocessing steps: stripping URLs, normalizing text, removing stop words, and handling special characters. It felt less like data science and more like archaeology&#8212;scraping away dirt to reveal the artifact hidden beneath.</p><div><hr></div><h4>Building the Framework</h4><p>Once the noise was cleared, I turned to the heart of the project: <strong>sentiment analysis</strong>.</p><p>I didn&#8217;t start with deep neural networks or transformer models. Instead, I wanted to prove that <strong>a simple, accessible tool could still uncover powerful insights</strong>.</p><p>Enter <strong>TextBlob</strong>.</p><p>With its Pythonic simplicity, TextBlob let me classify tweets as <em>positive, negative,</em> or <em>neutral</em>. To some, it might seem too basic compared to today&#8217;s BERT or GPT-powered systems&#8212;but that was the beauty of it. The framework was lean, efficient, and approachable.</p><p>And soon enough, the results started pouring in.</p><div><hr></div><h4>What the Data Whispered</h4><p>The first time I visualized the sentiment distribution, it felt like watching a living heartbeat of the crowd. Suddenly, the noise had shape.</p><ul><li><p>I could see <strong>sentiment trends</strong> shifting around events.</p></li><li><p>Brands rising and falling in public favor.</p></li><li><p>Collective moods reacting in real-time to global happenings.</p></li></ul><p>This wasn&#8217;t just data&#8212;it was <strong>public opinion, quantified</strong>.</p><div><hr></div><h4>The Limitations I Faced</h4><p>Of course, I had my fair share of frustrations:</p><ul><li><p><strong>Language support</strong>: TextBlob only handled English. Every non-English tweet was a lost voice.</p></li><li><p><strong>Shallow classification</strong>: Some sarcasm or cultural nuance simply slipped through.</p></li><li><p><strong>Comparisons with advanced models</strong>: SVMs, LSTMs, and transformers promised higher accuracy, but I chose clarity and speed over complexity&#8212;for this paper at least.</p></li></ul><p>But every limitation also planted a seed for future work.</p><div><hr></div><h4>Lessons Learned</h4><p>Writing this paper wasn&#8217;t just about publishing&#8212;it was about learning.</p><p>I discovered that <strong>the hardest part of ML isn&#8217;t always the model&#8212;it&#8217;s the data.</strong><br>I learned how crucial it is to design with clarity, not just sophistication.<br>And most importantly, I realized that <strong>even simple approaches can have real impact</strong> when applied thoughtfully.</p><div><hr></div><h4>Where I&#8217;d Take It Next</h4><p>If I were to extend this research today, I&#8217;d explore:</p><ul><li><p><strong>Multilingual sentiment analysis</strong> to capture a truly global voice.</p></li><li><p><strong>Transformer-based models</strong> like BERT or RoBERTa for deeper contextual understanding.</p></li><li><p><strong>Real-time dashboards</strong> to let businesses visualize and act on sentiment as it unfolds.</p></li></ul><p>The journey started with TextBlob, but it certainly doesn&#8217;t end there.</p><div><hr></div><h4>Closing Thoughts</h4><p>Looking back, what began as a fascination with Twitter became a full-fledged research project that taught me more than I ever expected.</p><p>At its core, the paper was my attempt to decode the human voice&#8212;compressed into characters, hashtags, and emojis&#8212;and translate it into something organizations and individuals alike could understand.</p><p>And in that process, I realized something powerful: <strong>data doesn&#8217;t just tell us what happened. It tells us how we feel.</strong></p><p>That&#8217;s the story of my paper, and honestly, the story of why I fell in love with machine learning in the first place.</p>]]></content:encoded></item></channel></rss>