Pinned Loading
Repositories
- avnlp.github.io Public
avnlp/avnlp.github.io’s past year of commit activity - prp Public
Pairwise Ranking Prompting (PRP): Zero-shot LLM reranking library implementing efficient pairwise strategies (Heapsort, Sliding Window, All-Pairs). Mitigates position bias via bidirectional comparison and ensures reliability with structured Pydantic validation. Built for Haystack pipelines.
avnlp/prp’s past year of commit activity - agentic-med-diag Public
Agentic Graph RAG for Medical diagnosis and Clinical QA. Combines LangGraph-orchestrated multi-hop reasoning with parallel semantic/relational retrieval, Neo4j knowledge graphs, and vector search for evidence-grounded answers on MedQA, MedXpertQA, MedCaseReasoning and MMLU-Pro.
avnlp/agentic-med-diag’s past year of commit activity - llm-finetuning Public
Advanced LLM fine-tuning techniques: SFT (LoRA, QLoRA, DoRA, P-/Prefix-Tuning), GRPO, DPO, ORPO, KTO & PPO; composable correctness/format rewards + LLM-as-a-Judge evals (DeepEval, Evidently AI) across math, multi-hop, medical & general QA on Llama 3, Mistral, Phi-4, Gemma & Qwen3. Built on TRL, PEFT & Unsloth.
avnlp/llm-finetuning’s past year of commit activity - llm-blender Public
LLM-Blender: Ensembling framework that maximizes LLM performance via pairwise ranking. Employs PairRanker to rank candidates and GenFuser to merge outputs, generating superior responses by combining the diverse strengths of multiple open-source models.
avnlp/llm-blender’s past year of commit activity - grpo Public
Group Relative Policy Optimization (GRPO) implementations - NanoAhaMoment, GRPO:Zero, Simple GRPO, and GRPO from Scratch - spanning vLLM + DeepSpeed, custom Transformer stack, Bottle HTTP reference server, and pure PyTorch. Compares generation backends, reference policy strategies, reward designs, and loss functions on GSM8K and Countdown tasks.
avnlp/grpo’s past year of commit activity - vectordb Public
Production-ready Haystack/LangChain pipelines for Hybrid & Parent-Child Retrieval, Diversity Filtering, MMR, Metadata Filtering, Reranking, Query Enhancement, Multi-Tenancy, Agentic RAG across Pinecone, Weaviate, Milvus, Qdrant, Chroma with dataset loaders, evaluation tooling, and benchmarks for TriviaQA, ARC, PopQA, FactScore and Earnings Calls.
avnlp/vectordb’s past year of commit activity - rankers Public
Modular LLM ranking library for Information Retrieval and RAG. Implements state-of-the-art Pairwise, Setwise, and Listwise ranking with structured generation and specialized models (RankZephyr, RankLlama). Features efficient sorting algorithms, sliding windows, and zero-shot capabilities.
avnlp/rankers’s past year of commit activity - biothink Public
Self-Reflective Question Answering for Biomedical Reasoning. GRPO fine-tuning via QLoRA & Unsloth with rewards for correctness, relevance, groundness, utility & XML structure. Structured think → answer → self-reflection with context grading, relevance assessment & groundness evaluation. DeepEval LLM-as-a-Judge (GEval, Faithfulness, Relevancy).
avnlp/biothink’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…