Hello, I'm Bibek Dhungana

Transforming

BugsFeatures

DataInsights

Problem solver with a strong background in mathematics, physics, and coding.
I enjoy using data, ML, and AI to build practical systems that solve real problems.

2026-06-02 04:05:08
Interactive terminal

Type help to get started

>
dhunganabibek
0 cmd

Projects

A few things I've built that I'm proud of.

ResearchTeam

githubFull StackML / AI

An open-source full-stack platform that unifies grant discovery and researcher matching across a corpus of 80,000+ U.S. federal grants. Its semantic retrieval engine (Cohere embed-v4 embeddings on S3 Vectors) outperformed BM25, TF-IDF, and keyword baselines (MRR 0.74 vs. 0.54; nDCG@5 +47%) over 500 researcher profiles. Grants.gov ingestion is automated via AWS Glue ETL, deployed serverless on Lambda with Cognito auth and GitHub Actions CI/CD.

Next.jsFastAPIAWS AuroraAmazon BedrockAWS Glue+2 more

StreamCab — Real-Time Taxi Fare Intelligence

githubDataML / AI

A containerized Kafka + Spark Structured Streaming pipeline (Docker Compose) that ingests NYC TLC taxi trips and computes 10-minute zone aggregates in PostgreSQL. A memory-aware XGBoost fare predictor (continued boosting over batched parquet) reaches 3.42% MAPE — a 78% error reduction over the zone-hour-mean baseline (MAE $0.49 vs. $1.99) — surfaced in a live analytics dashboard.

PythonApache KafkaSpark Structured StreamingXGBoostPostgreSQL+1 more

RAG: The Philosophical Computer

githubML / AI

A privacy-first, fully local RAG system that ingests academic PDFs into a semantic vector store and synthesizes grounded outputs via a locally-served LLM (Llama 3 / gpt-oss 20B). Built with LangChain, ChromaDB, and HuggingFace MiniLM embeddings on Ollama, it eliminates external API calls and cloud costs entirely.

PythonLangChainChromaDBHuggingFace MiniLMOllama+1 more

Apparel Classification System

githubML / AIFull Stack

A Fashion-MNIST clothing classifier built with softmax and One-vs-Rest logistic regression over PCA features, reaching ~85% test accuracy while PCA cuts training time ~3.5x; rotation robustness was analyzed as part of the evaluation. Served end-to-end via FastAPI + React with real-time camera inference and background removal.

Pythonscikit-learnPCAFastAPIReact+1 more

Quantum Adaptive Self-Attention (QASA)

githubML / AI

A hybrid quantum-classical Transformer that replaces the final encoder block's FFN with a parameterized quantum circuit (data re-uploading + entanglement) in PyTorch and PennyLane, trained end-to-end via the parameter-shift rule. It matched a classical Transformer baseline (R² 0.88 vs. 0.90) with ~6% fewer parameters and retained R² 0.86 under IBM Quantum hardware-noise evaluation.

PythonPyTorchPennyLaneQuantum ComputingTransformers+1 more

Education

The foundation behind everything I build.

Vanderbilt University

M.S. & Ph.D. in Computer Science — Artificial Intelligence & machine learning

August 2025Present
Nashville, Tennessee, USA
GPA: 4.0/4.0Graduate Research AssistantMachine Learning, Artificial Intelligence

Key Courses

Numerical Analysis
Machine Learning
Artificial Intelligence
Deep Learning
Quantum Computing
Big Data

Texas Tech University

Bachelor of Science in Computer Science, Minor in Mathematics

Jan 2019May 2023
Lubbock, Texas, USA
GPA: 3.9/4.0Summa Cum Laude

Key Courses

Data Structures & Algorithms
Database Management Systems
Computer Networks
Software Engineering
Operating Systems
Object-Oriented Programming
Computer Architecture
Discrete Mathematics
Theory of Computation
Automata

Experience

Places I've worked and problems I've solved.

August 2025Present

Teaching & Research Assistant

Vanderbilt University, SOL Lab

Nashville, Tennessee
Applied ResearchLLMsInformation RetrievalTeaching
PythonFastAPILLMsSemantic RetrievalAWS

Role Description

  • Architecting research.team, an LLM-powered platform that unifies grant discovery and team formation via semantic retrieval over 80,000+ federal grant opportunities, grounded in a mixed-methods HCI study (N=36 interviews, N=306 survey) and a retrieval benchmark against BM25/TF-IDF baselines.
  • Teaching Assistant for Algorithms (CS 3250), supporting 60+ students through weekly office hours, exam review sessions, and grading on core data structures and complexity concepts.
July 2024July 2025

Software Engineer

St. Jude Children's Research Hospital – ALSAC

Memphis, Tennessee
Healthcare TechMicroservicesReal-time Data Streaming
AWS GlueApache KafkaMuleSoftSQL Server.NETReactOpenShiftDockerKubernetesJenkins
June 2023June 2024

Software Engineer

AGCO (Precision Planting)

Bloomington, Illinois
AgTechData CatalogCloud MigrationAPI Development
PythonNode.jsAWS AuroraDynamoDBS3Express.jsPuppeteerGitHub Actions
June 2022May 2023

Software Engineer Intern

Heavy Construction Systems Specialists (HCSS)

Houston, Texas
Construction TechAPI DevelopmentCI/CD PipelinesDeveloper Tools
.NETReactAzurePostgreSQLDockerAzure DevOps
January 2021January 2022

Research Assistant

Mobility Automation Lab, Texas Tech University

Lubbock, Texas
RoboticsComputer VisionSLAMAutonomous Systems
PythonROSOpenCVAWSEC2S3NVIDIA Jetson
May 2021August 2021

Software Engineering Intern

NSF I-Corps Program

Lubbock, Texas
Research CommercializationFull Stack DevelopmentCustomer DiscoveryUI/UX Design
ReactSpring BootMySQLAWSOAuth2

Skills & Expertise

What I work with day to day.

5Domains66Tools27Expert-level
Expert(4)
PyTorchHugging Facescikit-learnRAG
Proficient(10)
LLMsGenerative AILangGraphMLflowFeature EngineeringModel DeploymentTransformersNLPExperiment TrackingComputer Vision
ExpertProficientFamiliar

About Me

The short version.

I'm a passionate developer who transforms ideas into robust, scalable solutions through clean code and data-driven decisions. Whether it's designing modern frontends, engineering performant backends, or extracting insights from data, I bring full-stack versatility with a problem-solving mindset.

My passion also extends to data science, cleaning data, building ML models, and using Python tools like Pandas, Scikit-learn, and PyTorch to turn raw data into smart decisions. Whether it's transforming raw data into actionable insights, building predictive models, or visualizing trends, I love using data to tell compelling stories and solve real-world problems.

Outside of work, I enjoy tennis and exploring the latest in AI, DevOps, and systems design. My mission? To build secure, intelligent, and impactful software that scales.

I thrive in collaborative environments where problem-solving, creativity, and curiosity come together. Let's connect and build something extraordinary!

Get In Touch

Have something in mind? Let's talk.

832-310-6869Nashville, Tennessee