Thura Win Kyaw

AI/ML/Electrical Engineer

Building AI systems that reason over knowledge.

python · rag · langchain · nlp · RL · vector-db · ollama

Thura Win Kyaw

I'm Thura Win Kyaw, an AI/ML Engineer based in Daejeon, South Korea. I design and build RAG pipelines, LLM applications, and conversational AI systems — currently developing intelligent biomedical knowledge retrieval at AIdenBio. My work sits at the intersection of large language models, retrieval systems, and real-world software, turning research ideas into production-grade AI applications.

I specialize in Retrieval-Augmented Generation (RAG), LangChain-based pipelines, and LLM-powered conversational systems. I enjoy designing end-to-end AI workflows — from vector search and prompt engineering to FastAPI backends and local model deployment with Ollama.

Alongside my engineering work, I'm completing a Bachelor's degree in Electrical Engineering at Chungnam National University, where I explore the overlap between control systems, signal processing, and intelligent automation. I hold HarvardX certifications in Computer Science, Python, and AI.

Based in Daejeon, South Korea
Currently AI Engineer at AIdenBio
Studying B.E. Electrical Engineering, CNU
Focus RAG · LLMs · Biomedical AI
Open to Research roles & internships

Research Interests

Retrieval-Augmented Generation NLP, Biomedical NLP Reinforcement Learning (RL) In-context learning (ICL) Prompt Engineering LLM Evaluation Conversational AI Systems Local LLM Deployment

Iris Flower Classification

Python · SVM · Scikit-learn · PCA

Supervised SVM classifier with StandardScaler preprocessing and GridSearchCV hyperparameter tuning. Evaluated via accuracy, confusion matrix, and F1 score. Decision boundaries visualized with PCA; best model pipeline serialized for reuse.

Customer Segmentation

Python · K-means · Scikit-learn

Unsupervised K-means clustering on customer demographic data to identify actionable segments. Elbow method and silhouette analysis for optimal k selection. Trained model and scaler serialized for downstream deployment.

Text Classification

Python · NLP · TF-IDF · SVM · Logistic Regression

Multi-model text classifier with TF-IDF features across Naive Bayes, Logistic Regression, and SVM with GridSearchCV tuning. Evaluated with precision, recall, and F1. Model, vectorizer, and label encoder saved for deployment.

Quote Sentiment Analysis

Python · NLP · VADER · BeautifulSoup

End-to-end sentiment pipeline for short quotes — scraped with requests and BeautifulSoup, labeled with VADER, trained with Naive Bayes and Logistic Regression. Evaluated with confusion matrices and word cloud visualizations.

Programming Languages
Python C MATLAB
Tools & Platforms
VS Code Cursor Claude Code Ollama Codex OpenClaw Jupyter Notebook Git GitHub Hugging Face Notion MS Office Google Workspace Docker
Operating Systems
Windows Linux macOS
Human Languages
Burmese Native English Fluent Korean TOPIK 5
Dec 2024 – Present

AI, Software & Prompt Engineer

AIdenBio · Daejeon, KR

  • Designed and maintained backend systems for AI model integration, API development, and database management.
  • Engineered and optimized prompts for LLMs to generate personalized, context-aware responses.
  • Developed and deployed RAG pipelines improving output accuracy for biomedical research workflows.
  • Built production-grade intelligent systems for life sciences applications.
Oct 2025 – Nov 2025

AI Software Engineer Intern

GRINDA AI · Daejeon, KR

Built an AI-powered Slack bot that automatically converts issue reports into structured GitHub issues using Claude AI and FastAPI, with auto-labeling, translation, and monitoring.

B.E. in Electrical Engineering

Chungnam National University · Daejeon, KR

Sept 2022 – Present

Relevant coursework: Computer Programming (C), Linear Algebra, AI and Future Society, Modern Control Theory and Design, Sensor and Measurement Engineering.

Certifications
HarvardX: CS50 Python HarvardX: CS50x HarvardX: CS50 AI with Python