Bowen Xue

📄 Curriculum Vitae

🎓 Education

University of Science and Technology of China (USTC)

Sep. 2023 – Present

Undergraduate Student in Computer Science and Technology

Expected Graduation: June 2027

💼 Internships and Projects

MIT HAN Lab Intern

Apr. 2025 – Present

Conducting research on generative model acceleration at MIT HAN Lab under the guidance of PhD student Muyang Li.

  • Core contributor to nunchaku (2.6K⭐) and ComfyUI-nunchaku (1.8K⭐) Integrated PuLID into nunchaku, enabling extreme acceleration and memory compression for Flux1.dev-based PuLID with minimal quality loss
  • Developed corresponding ComfyUI nodes
  • Conducting research on quantization training of diffusion models

    AIGC, Diffusion, Quantization, Acceleration

    Tencent Intern

    Nov. 2024 – Sept. 2025

    Worked at WXG, researching and implementing the latest image/video face-swapping algorithms.

    • First author of Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation, achieving superior facial similarity and naturalness with significant efficiency gains. Released code and project page. Supports face swapping, non-human identity-preserving generation, stylized video generation, and pose-driven generation
    • Designed new face fusion algorithm for WeChat Channels, surpassing IP-Adapter, InstantID, PuLID; reduced inference time from 20s to 5s
    • Performed large-scale fine-tuning of SDXL for high-quality Asian portraits

    T2I, T2V, IP2V

    IDEA ReadPaper Intern

    Aug. 2024 – Nov. 2024

    Designed and built academic graph systems and disambiguation algorithms for LLM reasoning and paper recommendation.

    • Developed knowledge graph schema for academic resources
    • Built entity disambiguation and deduplication systems
    • Integrated into production at www.readpaper.com

    Graph datasets, RAG

    LoRAExpand

    Nov. 2023 – Jun. 2024

    Built a model capable of generating landscape paintings while placing specific calligraphy in designated regions.

    • Trained LoRA with Chinese landscape dataset
    • Integrated SVD for text-to-video rendering
    • Combined with AnyText for dynamic calligraphy placement
    • Deployed UI with Gradio
    • Code: GitHub

    Diffusion, LoRA, AnyText, Gradio

    Deepin AI QAbot

    May 2024 – Aug. 2024

    Built a lightweight RAG system to answer domain-specific queries over structured documents.

    • Designed RAG pipeline with custom retriever + reader + answerer
    • Deployed with Gradio UI and minimal hardware requirement

    LLM, RAG, Gradio

    🛠 Technical Skills

    • Languages: Python
    • Platforms: Linux, Windows

    ℹ️ Additional Information

    • Research internship with Prof. Xiangnan He (USTC)
    • Proficient in LaTeX, Git, and Docker
    • Strong ability in reading and writing academic papers in English