Cell to Set ✨

A powerful, user-friendly Streamlit web application that transforms Excel spreadsheets into clean JSON data and SQL queries in seconds.

📋 Table of Contents

Features
Demo
Installation
Usage
Configuration Options
Technical Details
API Reference
Contributing
License

✨ Features

Core Functionality

Excel File Upload: Support for both .xlsx and .xls file formats
Multi-Sheet Support: Automatically detects and allows selection from multiple sheets within a workbook
JSON Conversion: Convert Excel data to JSON with multiple orientation options
SQL Generation: Generate CREATE TABLE and INSERT INTO statements for MySQL, PostgreSQL, and SQLite

Data Cleaning

Intelligent Null Handling: Automatically detects and filters various null representations:
- NA, null, None
- nan, ns
- not available
- Empty strings and whitespace-only values
Column-Selective Cleaning: Choose specific columns for null filtering
Real-time Statistics: View retained rows, dropped rows, and retention rate

User Interface

Modern UI: Clean, professional interface with custom CSS styling
Data Preview: Preview raw and cleaned data before downloading
SQL Preview Tabs: View CREATE TABLE, INSERT INTO, and full SQL separately
Download Options: One-click download for both JSON and SQL outputs

🎬 Demo

Link to Project https://cell-to-set.streamlit.app/

Demo Video

![Watch the Demo]

Workflow

Upload your Excel file (.xlsx or .xls)
Select the sheet to convert (if multiple sheets exist)
Click "🚀 Start Process"
Review data overview and cleaning results
Configure JSON/SQL options in the sidebar
Download your converted JSON or SQL file

🚀 Installation

Prerequisites

Python 3.8 or higher
pip package manager

Setup

Clone the repository:

git clone https://github.com/MrImaginatory/Cell_to_Set.git
cd Excel_to_Json

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Run the application:
```
streamlit run excelJson.py
```
Open your browser and navigate to http://localhost:8501

📖 Usage

Basic Usage

Upload File: Drag and drop or click to upload an Excel file
Select Sheet: If the workbook contains multiple sheets, select the desired one
Start Process: Click the "🚀 Start Process" button to begin conversion
Review Results: Check the data overview and cleaning statistics
Download: Use the download buttons to save your JSON or SQL output

Data Cleaning

The application automatically cleans your data by:

Converting various null representations to proper NA values
Removing rows with invalid data in selected columns
Preserving all valid data rows

JSON Output Options

Configure JSON output format in the sidebar:

Orientation	Description
`records`	List of dictionaries (default, most common)
`columns`	Dictionary with column names as keys
`index`	Dictionary with row indices as keys
`values`	Nested array of values
`table`	Table schema format

SQL Output Options

Option	Description
Table Name	Custom name for the generated SQL table
SQL Dialect	Choose between MySQL, PostgreSQL, or SQLite

⚙️ Configuration Options

Sidebar Settings

JSON Options

Orientation: Select the JSON structure format
Indent Level: Control formatting indentation (0-4 spaces)

SQL Options

Table Name: Specify the target table name (default: my_table)
SQL Dialect: Choose target database:
- MySQL
- PostgreSQL
- SQLite

🔧 Technical Details

Dependencies

Package	Purpose
`streamlit`	Web application framework
`pandas`	Data manipulation and Excel reading
`openpyxl`	Excel file format support

Data Type Mapping (SQL)

Pandas dtype	MySQL	PostgreSQL	SQLite
`int*`	INT	INTEGER	INT
`float*`	DOUBLE	DOUBLE PRECISION	REAL
`bool`	BOOLEAN	BOOLEAN	BOOLEAN
`datetime`	DATETIME	TIMESTAMP	DATETIME
`object/string`	VARCHAR(255)	TEXT	TEXT

Caching

The application uses Streamlit's @st.cache_data decorator for:

get_sheet_names() - Caches sheet names for performance
load_data() - Caches loaded DataFrames

📚 API Reference

Functions

`get_sheet_names(file)`

Retrieves all sheet names from an Excel file.

Parameters:

file: Uploaded Excel file object

Returns:

list: List of sheet names or None on error

`load_data(file, sheet_name=0)`

Loads a specific sheet from an Excel file into a pandas DataFrame.

Parameters:

file: Uploaded Excel file object
sheet_name: Sheet name or index (default: 0)

Returns:

DataFrame: Loaded data or None on error

`clean_data(df, selected_cols)`

Cleans a DataFrame by converting null-like values and dropping invalid rows.

Parameters:

df: Input DataFrame
selected_cols: List of columns to clean

Returns:

DataFrame: Cleaned DataFrame

Null Values Detected:

NA, null, None (case insensitive)
nan, ns (case insensitive)
not available (case insensitive)
Empty strings and whitespace

`sanitize_name(name)`

Sanitizes a column or table name for SQL compatibility.

Parameters:

name: Original name string

Returns:

str: Sanitized, lowercase name with underscores

`map_dtype_to_sql(dtype, dialect='mysql')`

Maps pandas data types to SQL column types.

Parameters:

dtype: pandas dtype object
dialect: Target SQL dialect ('mysql', 'postgresql', 'sqlite')

Returns:

str: SQL column type string

`generate_sql(df, table_name, dialect='mysql')`

Generates CREATE TABLE and INSERT INTO SQL statements.

Parameters:

df: Input DataFrame
table_name: Target table name
dialect: SQL dialect for type mapping

Returns:

tuple: (create_statement, insert_statements)

🎨 UI Components

Page Configuration

Page Title: "Cell to Set"
Page Icon: ✨
Layout: Wide
Initial Sidebar State: Expanded

Custom Styling

The application includes custom CSS for:

Header styling (.main-header, .sub-header)
Button hover effects
Benefit cards for the landing page
Metric value formatting

📁 Project Structure

Excel_to_Json/
├── excelJson.py        # Main application file
├── requirements.txt    # Python dependencies
├── README.md           # Documentation
└── .gitignore          # Git ignore rules

🤝 Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Code Style

Follow PEP 8 guidelines
Add docstrings to all functions
Include type hints where appropriate

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built with Streamlit
Data processing powered by pandas
Excel file support via openpyxl

📧 Support

For issues, questions, or suggestions, please open an issue in the repository.

Made with ❤️ for efficiency.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
DEMO_VIDEO.webm		DEMO_VIDEO.webm
README.md		README.md
excelJson.py		excelJson.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Cell to Set ✨

📋 Table of Contents

✨ Features

Core Functionality

Data Cleaning

User Interface

🎬 Demo

Link to Project https://cell-to-set.streamlit.app/

Demo Video

Workflow

🚀 Installation

Prerequisites

Setup

📖 Usage

Basic Usage

Data Cleaning

JSON Output Options

SQL Output Options

⚙️ Configuration Options

Sidebar Settings

JSON Options

SQL Options

🔧 Technical Details

Dependencies

Data Type Mapping (SQL)

Caching

📚 API Reference

Functions

get_sheet_names(file)

load_data(file, sheet_name=0)

clean_data(df, selected_cols)

sanitize_name(name)

map_dtype_to_sql(dtype, dialect='mysql')

generate_sql(df, table_name, dialect='mysql')

🎨 UI Components

Page Configuration

Custom Styling

📁 Project Structure

🤝 Contributing

Code Style

📝 License

🙏 Acknowledgments

📧 Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`get_sheet_names(file)`

`load_data(file, sheet_name=0)`

`clean_data(df, selected_cols)`

`sanitize_name(name)`

`map_dtype_to_sql(dtype, dialect='mysql')`

`generate_sql(df, table_name, dialect='mysql')`

Packages