Skip to content

KaranSinghDev/Comprehensive-Text-Analysis-Pipeline-Automated-Sentiment-and-Readability-Metrics-Extraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Sentiment Analysis and Text Insights Engine

Overview

This project is a sentiment analysis and text insights engine that extracts, preprocesses, and analyzes text data from web articles. It computes several text metrics, including sentiment scores, readability, and linguistic features like the Fog Index, complex word count, and personal pronoun frequency. The goal is to provide actionable insights into the content of articles, which can be useful for applications like content summarization, trend analysis, and sentiment tracking.

Key Features

  • Sentiment Analysis: Computes positive, negative, and polarity scores using VADER SentimentIntensityAnalyzer and TextBlob.
  • Text Readability: Calculates readability metrics like Fog Index, average sentence length, and percentage of complex words.
  • Linguistic Features: Extracts features such as personal pronouns, syllables per word, and average word length.
  • Web Scraping: Scrapes articles from the web using BeautifulSoup and requests.
  • Data Export: Saves processed results into an Excel file for easy analysis and downstream applications.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages