CV
General Information
| Full Name | Le Duc Anh Tuan (Charles) |
| Languages | English, Vietnamese |
Education
-
Oct. 2020 - Aug. 2024
BSc. in Data Science and Artificial Intelligence
Hanoi University of Science and Technology, Hanoi, Vietnam
Publications
- Speechless (Interspeech, 2025): Speech Instruction Training Without Speech for Low Resource Languages
- Poseless (arXiv, 2025): Depth-Free Vision-to-Joint Control via Direct Image Mapping with Vision Language Model
Experience
-
May. 2025 - Oct. 2025
GPU Engineer
Moreh Inc, Seoul, South Korea
- Implement a pure HIP C++ version of OpenAI’s MoE GPT-OSS from scratch (without rocBLAS/hipBLAS); optimize model loading, continuous batching, multi-streaming, multi-GPU communication, CPU–GPU–SRAM memory access, FlashAttention, MFMA GEMM, MoE load balancing; achieved 30k TPS (20B) and 10k TPS (120B) on a single node with 8× AMD MI250x GPUs.
-
Nov. 2024 - May. 2025
LLM Researcher
Menlo Research, Singapore, Singapore
- Developed a lightweight Speech Tokenizer (22M) using Residual Vector Quantization, achieving SOTA results on viVoice and LibriSpeech by extending the codebook size to 2048, training a two-phase training process (KL+CE and CE loss), contributing to the open-source WhisperSpeech codebase; outperformed original Whisper and PhoWhisper
- Researched the Speechless model, a modified Llama 3.2 1B architecture, to generate synthetic semantic audio representations from multimodal inputs using ASR datasets; paper accepted at Interspeech 2025 (Rank A)
- Modified the Llama tokenizer and performed continual pre-training on semantic tokens from ASR datasets, followed by post-training with mixed raw text, sound-text, and noise sound datasets (filtered by language identification, deduplication, length, and quality) to align with user preferences
- Published the package-modularized Ichigo on PyPI, supporting an asynchronous API for platform developers and implementing audio chunking with overlapping to support long audio input for ASR
-
Apr. 2024 - Nov. 2024
Data Scientist
Viettel Group, Hanoi, Vietnam
- Developed a multi-agent Conversational Recommendation System with multimodal capabilities, supporting vision input and speech-to-speech interaction with end-users
- Implemented AdaptiveICL to align with pre-defined expertise plans and designed a synthetic data pipeline for fine-tuning reasoning, SQL query generation, and function calling
- Built Retrieval, Ranking, and Query tools for database interaction; implemented a Candidate Bus for item candidate storage and Web Search for external resource integration
- Engineered an end-to-end system, including a Docker-wrapped API to bridge Application and Infrastructure layers
-
Sep. 2022 - Aug. 2024
Organizer
Google Developer Groups, Hanoi, Vietnam
- Planned and created project blueprints, delegated tasks to individuals, connected teams, and provided future operational guidance, certified by Google's Global Headquarters
-
Sep. 2023 - Dec. 2023
Applied Scientist
VinBrain (NVIDIA), Hanoi, Vietnam
- Developed MedNeXt for rectal cancer diagnosis and treatment support
-
Jan. 2022 - Sep. 2023
Research Assistant
Data Science Laboratory, HUST, Vietnam
- Researched class-specific prompt rehearsal-free in continual object detection
-
Mar. 2023 - Sep. 2023
Computer Vision Researcher
Viettel High Tech, Hanoi, Vietnam
- Designed the 3DNeRV architecture utilizing efficient cube-wise embedding for hybrid video representation, enabling parallel frame processing, simultaneous multi-frame reconstruction, and resolution flexibility
-
Sep. 2022 - Sep. 2023
Head of Event & PR Team
Google Developer Students Club, HUST, Vietnam
-
Aug. 2022 - Jan. 2023
Machine Learning Researcher
LEAN Platform, Southeast Asia
- Developed real-time Conv3D model for concentration recognition in online self-learning platform with 10000 users
Projects
-
2025
Leo
- Architected an LLMOps system for a personal AI assistant; encompassing Data, Feature, Training, Inference, and Observation components, following clean architecture principles
- Implemented an offline pipeline that retrieves data from data services and stores on S3; designed an ETL pipeline to crawl links and perform quality filtering; set up a feature generation pipeline for fine-tuning datasets and creating vector embeddings indexed in MongoDB for Hybrid Contextual Retrieval; and established a training pipeline with evaluation and serving model on HF/AWS endpoints, all orchestrated by ZenML
- Designed an online pipeline featuring an agentic RAG system, served via API using LiteLLM; utilizes summarization and retrieval tools (powered by fine-tuned LLM endpoints and a vector index database), supports Search MCP server, and incorporates observability components through prompt monitoring and RAG evaluation
-
2024
Gemini Omni
- Developed a real-time web application showcased at Google I/O Extended Hanoi, featuring speech-to-speech functionality, multimodal integration, and RAG for event updates
-
2023
Google I/O Extended Hanoi Website
- Designed front-end client and admin back-end websites, deployed hosting, and tracked event emails
Skills
- Areas of Interest: Multimodal LLMs, Multi-agent Systems, LLM Systems, High Performance Computing
- Programming Language: Python, C++, CUDA, HIP, Triton, TypeScripts, Java, Javascripts, SQL
- LLMOps: Docker, Kubernetes, AWS S3/Bedrock/SageMaker, MLflow, Airflow, ZenML, Weaviate, WandB
- Framework: PyTorch, TensorFlow, Hugging Face, vLLM, SGLang, LlamaFactory, ONXX
Volunteering
- Google Developer Group Hanoi: Speaker presents "Detecting Cheating in Examinations" at DevFest 2022
- SheCodes Vietnam: AI Mentor of SheCodes Hackathon Hanoi 2023
- Nestle: Ambassador of MT SparkTheNext Leaders Program 2023, 2024
- AIESEC: Representative of Mini Leadership Conference 2022
- VinAI Research: Technical Collaborator at AI Day 2022
Achievements
- Top 1: Viettel Digital Talent 2024 (Data Science and Artificial Intelligence)
- Third Prize: Excellent Students Contest in Math 2019 (Provincial Merit Competition)
- Winner: Innovation Lab Asia CrowdPitch
- Best Incubatee: TechYouth (VinUniversity)
- CCMG Grantee: Cyberport (2nd largest incubator in Hong Kong with 5 unicorns)
- Top 1: X-Challenge by VCCorp
- Top 1: Prometheus in Digital Transformation by European Union (out of 4000 teams)
- Top 1: Business Challenge 6 by Vietnam National University
- Top 5: Youth Impact Entrepreneurs by PNJ (VN30 Index)
- Top 6: Hult Prize Asia Summit 2022 (out of 1000 teams)
- Top 20: University Startup World Cup (out of 5000 teams)
- Top 30: Moonshot Global (out of 3000 teams)
- Top 100: XPITCH Global (out of 4000 teams)