About me

Hello! I'm Tuong, an AI Engineer with a degree in Artificial Intelligence from FPT University, specializing in Computer Vision and Multimodal AI. I'm passionate about research and driven to apply intelligent systems to real-world problems that make a difference.

I am currently researching and studying Graph Neural Networks at The Network Science Lab (NS Lab; 네트워크과학연구실), part of the Dept. of AI at CUK. During my employment at FPT, I engineered advanced OCR solutions, designed document intelligence systems that extract insights from unstructured data, developed 3D reconstruction pipelines, fine-tuned LLMs, and built RAG systems. When I'm not coding, you'll find me playing piano, exploring creative projects, or discovering new ways AI can transform our world.

What I'm doing

  • AI icon

    AI Research & Development

    Cutting-edge research in Computer Vision, NLP, and Multimodal AI systems with published papers and real-world applications.

  • ML development icon

    Machine Learning Engineering

    MLOps implementation, model deployment, and scalable AI solutions using cloud platforms and modern frameworks.

  • 3D reconstruction icon

    3D Reconstruction & Computer Vision

    Advanced 3D reconstruction pipelines from single/multi-view inputs using generative algorithms and deep learning.

  • document intelligence icon

    Document Intelligence & OCR

    Custom OCR engines, document processing systems, and intelligent data extraction from unstructured content.

Achievements & Recognition

  • FPT Corporation

    FPT Corporation

    Tuong demonstrates exceptional skills in AI engineering, leading multiple projects from 3D reconstruction to LLM deployment. His innovative approach to solving complex problems and ability to bridge research and practical applications makes him an invaluable team member.

  • Research Community

    Academic Recognition

    With 10+ published papers in top conferences and journals, Tuong has established himself as a promising researcher in Computer Vision and Multimodal AI. His work on bee queen detection, multimedia systems, and educational AI applications showcases his diverse expertise.

  • Competition Success

    Competition Excellence

    Multiple hackathon victories including Second Prize at FPT Edu Hackathon for Generative AI and DevFest MienTrung 2023. Tuong consistently delivers innovative solutions under pressure, combining technical excellence with creative problem-solving.

  • Community Impact

    Community Leadership

    As admin of Green Chemistry (50K+ likes, 91K+ followers) and co-founder of multiple STEM initiatives, Tuong demonstrates exceptional leadership and commitment to education in the tech community.

Invited Talks

  • Google I/O Extended

    Google I/O Extended

    Keynote speaker at Google I/O Extended, sharing insights on cutting-edge AI technologies and their real-world applications in modern software development.

  • UniHack

    UniHack

    Featured speaker at UniHack, discussing innovation in AI engineering and inspiring the next generation of developers with practical insights from industry experience.

  • GDG Mien Trung

    Facilitator at GDG Mien Trung 2025

    Lead facilitator at Google Developer Group Mien Trung 2025, organizing tech workshops and fostering developer community engagement across Central Vietnam.

Clients

Resume

Education

  1. Combined M.Eng./Ph.D. in Artificial Intelligence - The Catholic University of Korea

    Mar 2026 — Present

    Supervisor: Professor O-Joun Lee - Network science lab

  2. Bachelor of Artificial Intelligence - FPT University

    2021 — 2025

    GPA: 3.6 • Thesis: "Optimizing Multimedia Query Systems: Applying Multimodal Artificial Intelligence Models to Image and Video Data"

  3. Chemistry - Quoc Hoc Hue High School For The Gifted

    2018 — 2021

    Specialized in Chemistry at one of Vietnam's most prestigious high schools, developing strong analytical and research foundations.

Experience

  1. AI Engineer - FPT Corporation

    Jan 2024 — June 2025

    Leading development of 3D reconstruction pipelines, LLM fine-tuning, and RAG systems. Engineered advanced OCR solutions and designed document intelligence systems for automated data extraction.

  2. Research & Development

    2022 — Present

    Published 10+ research papers in Computer Vision, NLP, and Multimodal AI. Contributing to academic conferences and journals with innovative solutions for real-world problems.

  3. Student Researcher

    2021 — 2024

    Focused on Computer Vision and Multimodal AI research during university studies. Won multiple hackathons and competitions while maintaining excellent academic performance.

My skills

  • AI & Machine Learning

    PyTorch, TensorFlow, Computer Vision, NLP, Multimodal AI, Deep Learning

  • Programming Languages

    Python (Advanced), C/C++, JavaScript, SQL

  • MLOps & Deployment

    PyTorch Development, Model Deployment, LLM Fine-tuning, Quantization, TGI, RAG Systems

  • Cloud Platforms

    AWS, Google Cloud, Azure

  • Data & Databases

    Vector Database, Neo4j, ETL Development, Data Processing

  • Specialized AI

    3D Reconstruction, OCR Systems, Document Intelligence

Competition Excellence

  1. Second Prize - FPT Edu Hackathon for Generative AI

    2024

    Achieved second place in FPT's premier Generative AI hackathon, developing innovative AI solutions under tight deadlines. Demonstrated expertise in prompt engineering, model fine-tuning, and creative application of LLMs for practical problems.

  2. Winner - DevFest MienTrung 2023

    2023

    Won Google Developer Group's flagship competition in Central Vietnam, showcasing technical excellence and innovative problem-solving skills. Project focused on scalable web applications with modern development frameworks.

  3. Multiple University Hackathons

    2022 — 2024

    Consistent top performer in university-level competitions, including AI/ML challenges, web development contests, and innovation hackathons. Demonstrated ability to rapidly prototype solutions and work effectively in team environments.

  4. Academic Excellence Awards

    2021 — 2025

    Recipient of multiple academic recognition awards during Bachelor's degree, maintaining high GPA while actively participating in research projects and publishing papers in international conferences.

Research

Publications

  1. Explainable Intelligence in Digital Twins: State-Of-The-Art and Open Challenges

    2025 Accepted

    EIDT 2025

    A comprehensive survey examining the current state and future challenges of explainable AI in digital twin systems, identifying key research gaps and proposing future directions for the field.

  2. A review on Vision-Language-Based Approaches: Challenges and Applications

    2025 Accepted

    Computers, Materials & Continua

    A comprehensive review exploring the latest developments in Vision-Language models, examining challenges and real-world applications in AI systems. View Paper

  3. Bio-Inspired Algorithms in NLP Techniques: challenges, limitations and its applications

    2025 Accepted

    Computers, Materials & Continua

    Exploring bio-inspired computational approaches for natural language processing tasks. View Paper

  4. MoviePoster-Grounded contextual visualization using multimodal techniques

    2024 Accepted

    Lecture Notes on Data Engineering and Communications Technologies

    Innovative approach to contextual visualization using multimodal AI techniques for movie poster analysis. View Paper

  5. Face Detection Using Eigenfaces: A Comprehensive Review

    2024 Accepted

    IEEE Access

    Comprehensive review of Eigenfaces methodology for face detection, exploring traditional approaches and their evolution in modern computer vision applications. View Paper

  6. Mitigating hallucinations in large language models for educational application

    2024 Accepted

    IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia)

    Innovative approaches to reduce hallucinations in LLMs for educational contexts. View Paper

  7. Glimpse: A Multimodal-based Transforming Image Collection with Vector Database

    2024 Accepted

    International Conference on Information Networking (ICOIN)

    Novel multimodal system for transforming image collections using vector databases and AI techniques. View Paper

  8. A technical review on Bootstrapping Language-Image Pre-training

    2024 Accepted

    DBpia

    Technical review of BLIP (Bootstrapping Language-Image Pre-training) methodology and applications in multimodal AI systems.

  9. Investigating YOLO models for rice seed classification

    2023 Accepted

    Lecture Notes in Networks and Systems

    Application of YOLO object detection models for agricultural purposes, specifically rice seed classification. View Paper

  10. Evaluating Audio Feature Extraction Methods for Identifying Bee Queen Presence

    2023 Accepted

    SOICT '23 Conference Proceedings

    Novel application of audio signal processing and machine learning for automated bee queen detection in beekeeping. View Paper

Blog

Contact