John Rey Vilbar | Zach Jacob Feldan | Ruby Jane Facurib
ITD105 Big Data Analytics
Soundi.py is a web app designed for doing simple data visualizations of Spotify datasets. This application focuses on providing insights into the music details contained in the database, offering valuable information on the number of songs released each year and highlighting the top songs of each artist.
- Python Eel
- Pandas
- sklearn
- Tkinter
This project uses the Kaggle dataset by Vatsal Mavani, which can be found here.
- Setup a virtual environment (Python version used: 3.11.0):
python -m venv env
- Install the needed dependencies:
pip install -r requirements.txt
- Use the datasets (especially
data.csv
anddata_w_genres.csv
) provided in the Kaggle link above - Before loading to the web app, clean the csv file first busing the
clean.py
file - Execute the
app.py
file
- Upon launching the application, you will be prompted to upload your dataset.
- Select your file and click the "Analyze" button.
- Once your dataset is successfully loaded, the application will automatically initiate the analysis process, presenting an overview of the dataset selected.
- Another feature of the application is the "artist profile", wherein you can look for the detailed information of a specific artist. To do this, click the "Artist Profile" button on the menu bar.
- Dataset Upload: Users are prompted to upload their dataset upon launching the application.
- File Compatibility: The application recommends uploading a CSV file for optimal compatibility.
- Automatic Analysis: Once the dataset is successfully loaded, the application automatically initiates the analysis process.
- Dataset Overview: Presents a comprehensive overview of the dataset information extracted from the uploaded CSV file.
- Yearly Song Analysis: Generates a graph illustrating the number of songs released each year based on the dataset.
- Musical Key Share: Provides insights into the distribution of musical keys in the analyzed dataset.
- Artist Profile: Navigating to the "Artist Profile" section allows users to explore both "Overview" and "Artist Profile" options.
- Top 10 Songs Dashboard: Users can search for a specific artist and view a detailed dashboard showcasing the artist's top 10 songs over the years.
- Popularity Rating: Displays the popularity rating of the selected artist.
- Number of Followers: Presents information on the total number of followers for the selected artist.
- Music Genre: Indicates the predominant music genre associated with the selected artist.
- Average Tempo: Provides the average tempo of the released tracks by the selected artist.
- Catalog Information: Offers a catalog section showing the total tracks recorded in the dataset.
- Explicit Percentage: Displays the percentage of explicit tracks within the dataset for the selected artist.
- Average Tempo across the years: Generates a graph illustrating the average tempo of the selected artist's songs over the years.
- Average Loudness: Presents the average loudness of the songs over the years.
- Acousticness: Displays the acousticness of the songs over the years.
- Danceability: Presents the danceability or how suitable songs are for dancing over the years.
Feel free to explore the diverse functionalities of Soundi.py to enhance your understanding of Spotify dataset analytics.