🚀 Transformer Architecture

This repository contains a modular implementation of a Transformer model built entirely from scratch using NumPy leveraging OOPs concepts, without using PyTorch or TensorFlow.
It also includes training notebooks, Word2Vec-based embeddings, and utilities for low-level neuron analysis and debugging.

🔶 Components Overview

1. `src/transformer.py`

Implements the full Transformer architecture based on the Attention Is All You Need paper.

✔ Core Modules

Self Attention
Scaled Dot-Product Attention
Feed-Forward Networks (FFN)
Residual Connections + LayerNorm
Positional Encoding
Encoder Layer
Decoder Layer
Masked (causal) attention for decoding
Cross-attention between encoder → decoder

Supports:

Batching
Sequence-level attention
Word2Vec embeddings as token vectors

2. `src/MPNeuronInfo.py`

Contains fundamental neural components implemented from scratch:

✔ Layers

Layer_Dense
Activation_ReLU
Activation_Softmax

✔ Loss Function

Loss_CrossCategoricalEntropy

✔ Optimizer

OptimizerAdam (with momentum, RMS, and bias correction)

These mimic deep learning library internals but are written manually for transparency.

3. Tokenization & Embeddings

The project uses:

nltk.word_tokenize for tokenization
gensim.Word2Vec for dense vector embeddings

Workflow:

Tokenize English/Spanish sentences
Convert tokens → vectors via Word2Vec
Pass sequence embeddings → Transformer

4. Training Notebook (`notebooks/transformer_training.ipynb`)

Shows complete flow:

✔ Data Preprocessing

Tokenization
Vocabulary mapping
Embedding lookup
Padding & batching

✔ Training Loop

Forward pass
Loss computation
Backpropagation
Parameter updates (Adam)
Logging loss curves

✔ Inference Logic

Start with <SOS> token
Autoregressive decoding
Add positional encodings each step
Use encoder output for all decoding steps

🛠 Setup

python -m venv .venv
source .venv/bin/activate     # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
notebooks		notebooks
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Transformer Architecture

🔶 Components Overview

1. `src/transformer.py`

✔ Core Modules

2. `src/MPNeuronInfo.py`

✔ Layers

✔ Loss Function

✔ Optimizer

3. Tokenization & Embeddings

4. Training Notebook (`notebooks/transformer_training.ipynb`)

✔ Data Preprocessing

✔ Training Loop

✔ Inference Logic

🛠 Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 Transformer Architecture

🔶 Components Overview

1. src/transformer.py

✔ Core Modules

2. src/MPNeuronInfo.py

✔ Layers

✔ Loss Function

✔ Optimizer

3. Tokenization & Embeddings

4. Training Notebook (notebooks/transformer_training.ipynb)

✔ Data Preprocessing

✔ Training Loop

✔ Inference Logic

🛠 Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. `src/transformer.py`

2. `src/MPNeuronInfo.py`

4. Training Notebook (`notebooks/transformer_training.ipynb`)

Packages