Skip to content
View kezouke's full-sized avatar
β˜•
working for a cold americano
β˜•
working for a cold americano

Block or report kezouke

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kezouke/README.md

πŸ‘‹ Hi, I'm Elisey Smirnov. Coding since 2018 πŸ‘¨β€πŸ’»

ML Engineer (NLP) @ Innopolis University (GPA: 4.88)

Building intelligent systems at the intersection of NLP and production engineering

Python PyTorch LLM NLP

πŸš€ Work experience

⚑ LLM Generation Optimization

  • Boosted model accuracy by 90% via constrained decoding techniques
  • Productionized Hugging Face & VLLM pipelines
  • Open-Source Contribution: Added prompt tokens support to VLLM's Logits Processor (#4985)

πŸŽ“ Educational Course Automation

  • Built ETL pipeline: PDF parsing (Nougat) β†’ deduplication β†’ LLM enrichment
  • Automated theory generation & structural linking
  • Orchestrated 24-GPU cluster (6Γ—4xA100) with Prometheus/Grafana monitoring
  • Achieved 3x throughput using VLLM with dynamic load balancing

πŸ“š Git Documentation Toolkit

  • Reduced manual docs effort by 60% via auto-README generation
  • Developed context-aware summarization preserving documentation style

πŸ€– Multi-functional AI Agent on LLM

  • Architected LLM agent converting NLP queries to API call sequences (action planning)
  • Implemented hybrid argument extraction: slot filling + default values
  • Integrated with Google Calendar, Todoist, Gmail (+ extensible architecture)
  • Developed context preservation system improving planning accuracy by 40%

πŸ› οΈ Technical Arsenal

Languages & Frameworks
Python PyTorch Hugging Face Transformers VLLM REST API

ML Engineering
LLM Orchestration Constrained Decoding Multi-Agent Systems NLP Pipelines
Model Optimization Quantization Speculative Decoding

Infrastructure
Prometheus Grafana RabbitMQ MLflow Airflow Docker
Cluster Management Load Balancing GPU Optimization

πŸ” Key Competencies

  • LLM agent architecture design
  • Production inference optimization (↓latency/↑throughput)
  • Multi-modal pipeline development (PDF β†’ structured data)
  • Enterprise system integration
  • Full-cycle NLP solutions (POC β†’ MVP β†’ Scale)

πŸ“– About Me

  • Systems Thinker: Design solutions encompassing inference β†’ monitoring (e.g., 24-GPU cluster setup)
  • LLM Explorer: Implement cutting-edge techniques from AutoGPT/GPT-Engineer
  • Optimization Enthusiast: Experiment with quantization, speculative decoding

πŸŽ“ Education

BSc in Computer Science
Innopolis University (Expected 2026)

Certifications

  • Samsung IT School: Mobile Development (Java, Firebase)
  • Cyber-Physical Systems Development (Python, Cloud Tech)

⛷️ Beyond Code

  • Endurance kart racing (team competitions)
  • Skiing & snowboarding
  • Economics & marketing geek

GitHub Streak

πŸ“« Let's connect! LinkedIn | [email protected]

Pinned Loading

  1. IU-Capstone-Project-2024/SayNoMore IU-Capstone-Project-2024/SayNoMore Public

    The AI-Based Virtual Travel Assistant

    Python 3

  2. MRI-Diagnosis-API MRI-Diagnosis-API Public

    Deployment of ML model for brain tumor detection in an API and a web application that interacts with this API. The deployment utilizes Docker containers, FastAPI for the model API, and Streamlit fo…

    Python 1

  3. VAExperiment VAExperiment Public

    Simple VAE implementation exploring KL divergence and Wasserstein metric in the loss function

    Jupyter Notebook

  4. Numerical-Methods-DE Numerical-Methods-DE Public

    Numerical methods for solving first-order ordinary differential equations (ODEs) and systems of ODEs

    C++

  5. ForYourEyesOnlyyy/Sentiment_Analysis_for_Financial_News ForYourEyesOnlyyy/Sentiment_Analysis_for_Financial_News Public

    Building, experimenting, and deploying a machine learning-based solution for sentiment analysis on financial news articles

    Jupyter Notebook 1 1

  6. ReProcess ReProcess Public

    Tool designed for managing dependencies within code repositories

    Python 2