Skip to content
View bdqnghi's full-sized avatar
🎱
Focusing
🎱
Focusing

Block or report bdqnghi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bdqnghi/README.md

Hi, I'm Nghi πŸ‘‹

πŸ“ USA | πŸ€– Googler | 🧠 AI Research Scientist |

Python PyTorch TensorFlow Transformers LLMs Google Tree-sitter GNNs

Building intelligent coding agents and pushing the boundaries of AI for software engineering β€” from LLMs post-training to multi-agent systems that write real-world software.

Featured R&D Projects

  • πŸ—οΈ OpenDev - Open-source Coding Agent for the Terminal (470+ ⭐)
  • 🧬 SWE-EVO - A Challenging Benchmark for Coding Agents in the Software Evolution Scenarios
  • πŸ“š CodeWiki - Open-source DeepWiki: holistic repo-level documentation across multilingual codebases (760+ ⭐)
  • πŸ“š TheVault - Open-source Code-Comment Dataset or Instruction Tuning (109+ ⭐)
  • πŸ•΅οΈ HyperAgent - Generalist software agents to solve software engineering tasks (235 ⭐)
  • πŸ‘₯ AgileCoder - Agile methodology meets multi-agent systems for building real-world software (451 ⭐)
  • 🦫 CodeCapybara - Open-source self-instruction tuning Code LLM (171 ⭐)
  • πŸ–₯️ XMainframe - Language model for mainframe modernization (COBOL β†’ modern code) (68 ⭐)
  • πŸ”§ CodeTF - One-stop Transformer library for state-of-the-art Code LLMs (1.5k+ ⭐)
  • πŸš€ CodeT5 - Open Code LLMs for code understanding and generation (3k+ ⭐)

(Old Research)

  • 🧠 InferCode - Self-supervised learning of code representations (89 ⭐)
  • 🌳 AST Node Encoding - AST node vector embeddings for source code
  • πŸ”€ Bi-TBCNN - Bilateral tree-based convolutional neural network for code
  • πŸ“Š Graph-AST - Graph representations of source code
  • πŸ”— GGNN.TensorFlow - Gated graph neural networks for code classification
  • πŸ“– Awesome AI4Code - Curated list of AI4Code papers and datasets

GitHub Activity

GitHub Contribution Graph

What I'm Working On

  • Coding Agents - Building AI agents that can autonomously understand, navigate, and modify codebases
  • Code LLMs - Training and fine-tuning large language models specialized for code
  • Multi-Agent Systems - Orchestrating multiple AI agents for complex software engineering tasks
  • Code Representation - Learning deep representations of source code via graphs, trees, and transformers

Connect

Website Google Scholar GitHub

Pinned Loading

  1. opendev-to/opendev opendev-to/opendev Public

    Open-Source Coding Agent in the terminal

    Rust 522 64

  2. FSoft-AI4Code/CodeWiki FSoft-AI4Code/CodeWiki Public

    [ACL 2026] Open-source framework for holistic, structured repository-level documentation across multilingual codebases

    Python 879 134

  3. FSoft-AI4Code/AgileCoder FSoft-AI4Code/AgileCoder Public

    [FORGE 2025] Incorporating Agile methodology into agents to create complex real-world softwares

    Python 456 58

  4. FSoft-AI4Code/HyperAgent FSoft-AI4Code/HyperAgent Public

    Generalist Software Agents to Solve Soware Engineering Tasks

    Python 241 23

  5. SWE-EVO/SWE-EVO SWE-EVO/SWE-EVO Public

    Python 39 9

  6. FSoft-AI4Code/TheVault FSoft-AI4Code/TheVault Public

    [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation

    Jupyter Notebook 105 10