Skip to content
View gapsong's full-sized avatar
😇
😇

Block or report gapsong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gapsong/README.md

ML Engineer · LLM Efficiency & Quantization

I work at the intersection of extreme model compression and production-grade fine-tuning.

What I actually do: I make large language models run on hardware that wasn't supposed to support them — without destroying what makes them useful.

Shipped:

  • 🔧 Official QA-LoRA implementation in Hugging Face PEFT → PR #2571 · PR #2664
  • 📄 Master Thesis @ TU Berlin (supervised by Prof. Samek & Prof. Müller, Fraunhofer HHI): "Accelerating Quantization-Aware Training of 2-bit Compact LLMs" → Proposed SA-SVD: -63% training VRAM vs. standard LoRA, +150 perplexity points recovery on broken 2-bit models → Proposed DRA: error-based adapter initialization for parameter-efficient fine-tuning on resilient architectures
  • 🧪 SA-SVD reference implementation (open source, reproducible) → gapsong/sa-svd-qa-lora → Measured at 2-bit across three LLMs: better WikiText perplexity on every model tested (-15% to -48%) at identical training budget — and it makes Qwen2-1.5B trainable where standard random-init QA-LoRA diverges with inf gradients

Stack: PyTorch · Hugging Face (PEFT, Transformers, TRL) · GPTQ · bitsandbytes · Slurm · CUDA · AWS

Pinned Loading

  1. sa-svd-qa-lora sa-svd-qa-lora Public

    Structure-aware SVD initialization for QA-LoRA fine-tuning of 2-bit LLMs. Measured on 3 models: lower WikiText perplexity on every model tested (-15% to -48%), and makes Qwen2-1.5B trainable where …

    Python

  2. huggingface/peft huggingface/peft Public

    🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

    Python 21.3k 2.3k