Skip to content

FreedomIntelligence/Awesome-Specialized-Medical-LLMs

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 

Repository files navigation

Awesome-Specialized-Medical-LLMs🧑‍⚕️

This repository provides a curated collection of research on Specialized Medical Large Language Models (SMed-LLMs) for specific diseases and medical specialties, organized by ICD-10 chapters.

📋Table of contents


📽️Visualization

  • Annotated human body diagram illustrating LLMs in 45 specific diseases across 17 organ systems, including female-specific conditions. Organ systems are color-coded; disease names are in bold italics, followed by the corresponding model names.

  • Summary of all specialized medical LLMs for specific diseases and distinct medical specialties collected in this study, categorized by ICD-10 chapter list; disease names and specialty names are highlighted, the corresponding model names are listed.


Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Tuberculosis Transforming Tuberculosis Care: Optimizing Large Language Models for Enhanced Clinician-Patient Communication GenAI4Health @ AAAI 2025 Optimized a conversational AI for Spanish-speaking TB patients, focusing on cultural relevance, empathy, medical accuracy, and privacy. -
Advancing Chronic Tuberculosis Diagnostics Using Vision-Language Models: A Multi modal Framework for Precision Analysis arXiv 2025/03 Vision-language model using SIGLIP and Gemma-3b integrates chest X-rays and clinical data for accurate, automated chronic TB detection and reporting. -
HIV Enhanced Language Models for Predicting and Understanding HIV Care Disengagement: A Case Study in Tanzania Research Square 2025/05 Fine-tuned LLaMA 3.1 on Tanzanian EMR data for accurate, interpretable prediction of HIV care disengagement. -

Neoplasms (II)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Oncology Radonc-gpt: A large language model for radiation oncology arXiv 2023/09 Instruction-tuned LLM for radiotherapy plan generation and decision support. -
Oncogpt: A medical conversational model tailored with oncology domain expertise on a large language model meta-ai (llama) arXiv 2024/02 Multi-stage fine-tuned LLM for oncology Q&A and treatment recommendations. OncoGPT
SEETrials: Leveraging large language models for safety and efficacy extraction in oncology clinical trials Informatics in Medicine Unlocked 2024 GPT-4 plus prompts for automated extraction of clinical trial outcomes in oncology. -
LLM-driven multimodal target volume contouring in radiation oncology Nature Communications 2024 Multimodal LLM framework for 3D target volume segmentation in radiotherapy. LLMSeg
A vision-language foundation model for precision oncology Nature 2025 Unified vision-language model for multimodal cancer detection and biomarker prediction. MUSK
Cancer Cancerllm: A large language model in cancer domain arXiv 2024/06 Mistral-style LLM pre-trained and fine-tuned for cancer phenotype extraction and diagnosis. -
Breast Medicine Burextract-llama: An llm for clinical concept extraction in breast ultrasound reports Multimedia Computing for Health and Medicine 2024 Q-LoRA fine-tuned Llama3 model for structured information extraction from breast ultrasound. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Pancreatic Cancer MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection arXiv 2024/12 Multimodal LLM integrating CT and prompts for pancreatic cancer classification and detection. MiniGPT-Pancreas
Prostate Cancer RadOnc-GPT (gpt-4o) versus human data extraction for prostate cancer clinical research American Society of Clinical Oncology 2025 Instruction-tuned Llama2 automates radiotherapy regimens and clinical report generation. -
Hepatocellular Carcinoma ChatExosome: An Artificial Intelligence (AI) Agent Based on Deep Learning of Exosomes Spectroscopy for Hepatocellular Carcinoma (HCC) Diagnosis Analytical Chemistry 2025 Fuses exosome Raman spectra transformer with RAG-LLM for HCC diagnosis and Q&A. ChatExosome
Lung Cancer EXACT-Net: EHR-guided lung tumor auto-segmentation for non-small cell lung cancer radiotherapy arXiv 2024/02 Combines LLM-based EHR extraction with 3D U-Net for CT-based lung tumor segmentation. -
TCMLCM: an intelligent question-answering model for traditional Chinese medicine lung cancer based on the KG2TRAG method Digital Chinese Medicine 2025/03 Fine-tuned ChatGLM2-6B with TCM lung cancer data and knowledge graphs using the KG2TRAG method for accurate, professional QA in TCM lung cancer. -
Thyroid Nodules EndoGPT: A Proof-of-concept Large Language Model Based Assistant for the Management of Thyroid Nodules medRxiv 2024 GPT-4o with RAG and prompts for individualized thyroid nodule assessment and management. EndoGPT
Colorectal Cancer Frontiers in intelligent colonoscopy arXiv 2024/10 Multimodal LLM for interactive colonoscopy scene classification and visual-language reasoning. ColonGPT
Breast Cancer Breast-Crag: A Breast Cancer Large Language Model Leveraging Retrieval-Augmented Generation SSRN 5052341 LoRA-finetuned Qwen2.5 and RAG for breast cancer Q&A and exam tasks. Breast-Crag
LLaVA-MultiMammo: adapting vision-language models for explainable and comprehensive multiview mammogram analysis in breast cancer assessment SPIE Medical Imaging 2025 Adapts LLaVA VLM to integrate multi-view mammograms and clinical text for explainable multi-task breast cancer analysis, outperforming task-specific models in density and malignancy classification. -
Cervical Cancer Context-Aware Text-Assisted Multimodal Framework for Cervical Cytology Cell Diagnosis and Chatting IEEE ICME 2024 Integrates multimodal image-text transformers and LLM for cervical cytology classification. -
Thyroid Cancer Thyro-GenAI: A Chatbot Using Retrieval-Augmented Generative Models for Personalized Thyroid Disease Management Journal of Clinical Medicine 2025 Developed a RAG-based chatbot for personalized thyroid disease decision support, showing higher clinical accuracy and reliability than general LLMs. -

Reference Awesome-repo


Endocrine, Nutritional and Metabolic Diseases (IV)

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Diabetes Integrated image-based deep learning and language models for primary diabetes care Nature Medicine 2024 Vision transformer + LLM for fundus image analysis, DR grading, and personalized diabetes care. DeepDR-LLM
Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and Management arXiv 2024/09 Diabetes-specific LLM with LoRA/SFT for precise Q&A, patient consultation, and record summary. Diabetica
PIRsuader: A Persuasive Chatbot for Mitigating Psychological Insulin Resistance in Type-2 Diabetic Patients COLING 2025 Developed a persuasive LLM-based chatbot that uses dialog act schema and reinforcement learning to counsel T2D patients and reduce psychological insulin resistance. -
DiabetIQ: An Intelligent Diabetes ManagemenApplication with an Integrated LLM-AugmentedRAG Chatbot and ML-Based Risk Early Prediction ResearchGate Technical Report 2025 Developed an intelligent diabetes management app integrating an LLM-augmented RAG chatbot for reliable advice and an ML module for early risk prediction, providing personalized and explainable support to patients. -

Mental and Behavioural Disorders (V)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Psychiatry Psy-llm: Scaling up global mental health psychological services with ai-based large language models arXiv 2023/07 Pre-trained and fine-tuned on psychological Q&A datasets, delivers expert-level answers and urgent screening. PsyQA
Chatcounselor: A large language models for mental health support arXiv 2023/09 LLaMA-7B fine-tuned to provide professional counseling responses and mental health classification. ChatPsychiatrist
Mindwatch: A smart cloud-based ai solution for suicide ideation detection leveraging large language models medRxiv 2023 Fine-tuned transformer for suicide ideation detection, Llama2-RAG for personalized psychoeducation and plans. -
MentaLLaMA: interpretable mental health analysis on social media with large language models ACM Web Conference 2024 LLaMA2 with instruction tuning for detecting and explaining mental health conditions in social media. MentalLLaMA
CBT-LLM: A Chinese large language model for cognitive behavioral therapy-based mental health question answering arXiv 2024/03 Chinese LLM instruction-tuned on CBT QA, delivers structured CBT-based mental health support. CBT-LLM
WundtGPT: Shaping Large Language Models To Be An Empathetic, Proactive Psychologist arXiv 2024/06 LLaMA3-8B with instruction tuning and RLHF (KTO) to enhance empathy, generate diagnoses and counseling. WundtLLaMA

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Depression Detecting signs of depression from social media text using RoBERTa pre-trained language models LT-EDI-ACL 2022 Fine-tuned RoBERTa for detecting and quantifying depression in social media text. depression-detection-lt-edi-2022
VS-LLM: Visual-Semantic Depression Assessment Based on LLM for Drawing Projection Test PRCV 2024 Analyzes projection drawings to extract visual-semantic features of depression. -
InterMind: A Doctor-Patient-Family Interactive Depression Assessment System Empowered by Large Language Models arXiv 2024/09 Instruction-tuned LLM with RAG for interactive, multi-party depression assessment and personalized intervention. -
Autism Chatasd: Llm-based ai therapist for asd Digital TV & Wireless Multimedia Communications 2023 Fine-tuned multimodal LLM for ASD knowledge dissemination, auxiliary diagnosis, and intervention. -
SocialRecNet: A Multimodal LLM-Based Framework for Assessing Social Reciprocity in Autism Spectrum Disorder ICASSP 2025 Multimodal LLM integrating speech and text to assess social reciprocity and predict ADOS scores for ASD. -

Reference Awesome-repo


Diseases of the Nervous System (VI)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Neurology Neura: a specialized large language model solution in neurology medRxiv 2024 Retrieval-augmented LLM with memory modules for complex clinical reasoning and differential diagnosis in neurology. -
ExKG-LLM: Leveraging Large Language Models for Automated Expansion of Cognitive Neuroscience Knowledge Graphs arXiv 2025/03 LLMs for automated named entity recognition and knowledge graph expansion in cognitive neuroscience literature. -
Neurosurgery AtlasGPT: dawn of a new era in neurosurgery for intelligent care augmentation, operative planning, and performance Journal of Neurosurgery 2024 RAG-based LLM grounded in neurosurgical literature for precise surgical decision support and clinical summaries. -
LLM4DEU: Fine Tuning Large Language Model for Medical Diagnosis in Outpatient and Emergency Department Visits of Neurosurgery Tsinghua Science and Technology 2025 Proposes LLM4DEU, a fine-tuned ChatGLM-based LLM for neurosurgical diagnosis in outpatient and emergency settings, achieving state-of-the-art accuracy, notably improving prediction for rare diseases over strong baselines. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Stroke MBBo-RPSLD: Training a Multimodal BlenderBot for Rehabilitation in Post-Stroke Language Disorder IEEE J Biomed Health Informatics 2025 Multimodal encoding and conversational generation for personalized speech rehab in post-stroke aphasia. -
Parkinson’s Disease Autohealth: Advanced llm-empowered wearable personalized medical butler for parkinson’s disease management IEEE CCWC 2024 LLM-powered assistant fusing wearable and speech data for individualized Parkinson’s detection and management. -
Alzheimer’s Disease DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature arXiv 2024/05 Builds a disease-specific knowledge graph using LLMs to enhance retrieval and Q&A for Alzheimer’s. DALK
DECT: Harnessing LLM-assisted Fine-Grained Linguistic Knowledge and Label-Switched and Label-Preserved Data Generation for Diagnosis of Alzheimer's Disease arXiv 2025/02 Fine-tuned BioBERT extracts fine-grained linguistic features from speech for Alzheimer’s detection. -
AD-GPT: Large Language Models in Alzheimer's Disease arXiv 2025/04 Stacked BERT-Llama3 model for Alzheimer’s genetic information retrieval and gene-disease relationship analysis. -
AD-AGENT: A Multi-agent Framework for End-to-end Anomaly Detection arXiv 2025/05 Proposed an LLM-driven multi-agent system that turns natural language instructions into executable anomaly detection pipelines across multiple libraries and data modalities, making AD accessible for non-experts. AD-AGENT
Ad-autogpt: An autonomous gpt for alzheimer’s disease infodemiology PLOS Global Public Health 2025 Langchain and GPT-4-based agent automates news collection and topic analysis for Alzheimer’s infodemiology. AD-AutoGPT
ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator arXiv 2025/06 Developed an extensible LLM agent integrating multiple specialized tools for multi-modal Alzheimer’s diagnosis and prognosis, achieving state-of-the-art accuracy. -
Reasoning-Based Approach with Chain-of-Thought for Alzheimer's Detection Using Speech and Large Language Models arXiv 2025/06 Proposed a speech-to-text LLM framework with Chain-of-Thought reasoning for Alzheimer’s detection, achieving state-of-the-art accuracy and efficiency. -
Vestibular Schwannoma neuroGPT-X: toward a clinic-ready large language model Journal of Neurosurgery 2023 RAG-enhanced GPT model with domain-specific literature and conversational memory for point-of-care support. -
Epilepsy EpilepsyLLM: Domain-specific large language model fine-tuned with epilepsy medical knowledge arXiv 2024/01 LLaMA-based LLM fine-tuned on specialized instruction datasets to improve epilepsy domain expertise. -
Chronic Vertigo Classification of Chronic Dizziness Using Large Language Models Journal of Healthcare Informatics Research 2025 LLM-driven feature extraction and interpretable ML for automated classification of chronic vertigo etiologies. -

Reference Awesome-repo


Diseases of the Eye and Adnexa (VII)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Ophthalmology Ophtha-llama2: A large language model for ophthalmology arXiv 2023/12 LoRA fine-tuning on clinical reports for ophthalmic impression generation from imaging. -
OphGLM: An ophthalmology large language-and-vision assistant Artificial Intelligence in Medicine 2024 Multimodal model for interactive fundus image analysis and Q&A. OphGLM
EYE-Llama, an in-domain large language model for ophthalmology bioRxiv 2024 Two-stage pretraining and QLoRA fine-tuning for improved ophthalmic QA. EYE-Llama
Eyegpt: Ophthalmic assistant with large language models arXiv 2024/03 Domain-specific fine-tuning and retrieval-augmented generation for ophthalmic Q&A and reasoning. -
Eyefound: a multimodal generalist foundation model for ophthalmic imaging arXiv 2024/05 Masked autoencoder for robust ocular and systemic disease prediction and VQA. -
Visionunite: A vision-language foundation model for ophthalmology enhanced with clinical knowledge arXiv 2024/08 Fuses vision encoder and LLM for multimodal, multi-disease diagnosis and clinical explanation. VisionUnite
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis arXiv 2024/09 CLIP-based multimodal pretraining for zero-shot disease classification, prediction, and VQA. EyeCLIP
Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model arXiv 2024/10 Instruction-tuned LLM for ophthalmic QA, diagnosis, and EHR summarization. leme_eye_llm

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Glaucoma Xiaoqing: A Q&A model for glaucoma based on LLMs Computers in Biology and Medicine 2024 LoRA fine-tuned ChatGLM-6B with RAG for glaucoma Q&A using specialized and external data. Xiaoqing
Diabetic Retinopathy DR-GPT: A large language model for medical report analysis of diabetic retinopathy patients Plos One 2024 Fine-tuned transformer for automated severity and gradability classification from clinical reports. -
Choroidal and Retinal Diseases ICGA-GPT: report generation and question answering for indocyanine green angiography images British Journal of Ophthalmology 2024 Multimodal LLM for bilingual report generation and Q&A from ICG angiography images. -
RetinalGPT: A Retinal Clinical Preference Conversational Assistant Powered by Large Vision-Language Models arXiv 2025/03 LLaVA-like multimodal model for disease diagnosis, lesion localization, analysis, and dialogue on fundus images. -
Age-related Macular Degeneration Specialized curricula for training vision-language models in retinal image analysis Preprint 2024 Instruction-tuned MiniGPT-4-like model for AMD staging, referral, report generation, and VQA on OCT. SpecialistVLMs

Reference Awesome-repo


Diseases of the Ear and Mastoid Process (VIII)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Otolaryngology ENTAgents: AI Agents for Complex Knowledge Otolaryngology medRxiv 2025 ENTAgents integrates RAG and multi-agent LLMs to enhance clinical reasoning in otolaryngology. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Vestibular Schwannoma neuroGPT-X: toward a clinic-ready large language model Journal of Neurosurgery 2023 neuroGPT-X augments a GPT-based conversational platform with domain-specific knowledge for vestibular schwannoma management. -

Diseases of the Circulatory System (IX)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Cardiology HuBERT-ECG: a self-supervised foundation model for broad and scalable cardiac applications medRxiv 2024 HuBERT-ECG is a self-supervised foundation model for scalable cardiac tasks based on ECG data. hubert-ecg-base
Zodiac: A Cardiologist-Level LLM Framework for Multi-Agent Diagnostics arXiv 2024/10 Zodiac uses a multi-agent LLM framework for multimodal patient data and cardiologist-level reporting. -
MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report arXiv 2024/10 MoRE enables zero-shot classification and cross-modal retrieval by integrating X-ray, ECG, and report representations. MoRE
CVDLLM: Automated Cardiovascular Disease Diagnosis with Large-Language-Model-Assisted Graph Attentive Feature Interaction IEEE Transactions on Artificial Intelligence 2025 CVDLLM combines time-series neural networks, graph attention, and LLM embeddings for ECG-based multi-disease classification. -
ECG-FM: An Open Electrocardiogram Foundation Model arXiv 2025 Presents ECG-FM, an open transformer-based ECG foundation model pretrained on 1.5M ECGs using hybrid self-supervised learning, achieving state-of-the-art, label-efficient, and robust performance across multiple ECG analysis tasks. ECG-FM
CardioMind - CardioMind is a cardiovascular AI model designed to enhance intelligent medical diagnosis. CardioMind
Internal Medicine Inmd-x: Large language models for internal medicine doctors arXiv 2024/02 InMD-X applies continued pre-training and LoRA-based fine-tuning for robust internal medicine QA. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Arrhythmia Ecgbert: Understanding hidden language of ecgs with self-supervised representation learning arXiv 2023/06 ECGBERT uses a BERT-style transformer for contextual ECG representation and precise arrhythmia detection.
Ecg semantic integrator (esi): A foundation ecg model pretrained with llm-enhanced cardiological text arXiv 2024/05 ESI integrates RAG and multimodal pretraining to automate ECG description and arrhythmia diagnosis. ESI
Anomalous Aortic Origin of Coronary Arteries LLM-TA: An LLM-Enhanced Thematic Analysis Pipeline for Transcripts from Parents of Children with Congenital Heart Disease arXiv 2025/02 LLM-TA uses a GPT-4o-driven pipeline to extract codes and themes from interview transcripts for AAOCA. LLM-TA

Reference Awesome-repo


Diseases of the Respiratory System (X)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Pulmonology RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction arXiv 2024/10 RespLLM integrates clinical text and respiratory audio signals to automate comprehensive respiratory health screening and diagnosis. RespLLM
LUNG-GPT: Lung sound analysis with LLM-Based model Preprint 2024 LUNG-GPT processes lung sound recordings via Mel-spectrograms and deep learning for disease detection and detailed respiratory event analysis. -
Towards open respiratory acoustic foundation models: Pretraining and benchmarking NeurIPS 2024 The OPERA framework pre-trains three foundation models on 130,000+ respiratory sounds, outperforming general audio models on 16/19 health tasks and showing strong generalizability. OPERA

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Asthma AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support arXiv 2024/09 AsthmaBot applies multimodal, multilingual RAG to answer asthma-related questions using text, images, and videos. -
Chronic Lung Disease (COPD) Copd-ChatGLM: A Chronic Obstructive Pulmonary Disease Diagnostic Model IEEE International Conference on Bioinformatics and Biomedicine 2024 Copd-ChatGLM fine-tunes LLMs on patient histories and CT reports for accurate COPD diagnosis and personalized treatment recommendations. -
SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting arXiv 2024/07 Developed the first multimodal LLM that fuses spirogram time-series and PFT data for automated, interpretable COPD report generation, achieving high accuracy and robustness validated on large-scale clinical data. SpiroLLM
Pneumonia PneumoNet: Artificial Intelligence Assistance for Pneumonia Detection on X-Rays Applied Sciences 2025 Developed an AI system with modified AlexNet and GPT-Neo for accurate and explainable pneumonia detection and reporting from X-rays. -
Multimodal model for pneumonia detection based on enhanced stacking MOE IEEE EIECC 2024 Developed a multimodal stacking MOE model with ResNet-50 and BERT, achieving superior pneumonia detection over single-modality models. -
COVID-19 CovidLLM: A Robust Large Language Model with Missing Value Adaptation and Multi-Objective Learning Strategy for Predicting Disease Severity and Clinical Outcomes in COVID-19 Patients arXiv 2024/11 Instruction-tuned LLM (ChatGLM) using prompt-based missing value handling and multi-objective learning to predict COVID-19 severity and outcomes from serological data. CovidLLM
Assessing LLMs to Improve the Prediction of COVID-19 Status Using Microbiome Data Report / Poster 2025 Comparative benchmarking of four transformer-based LLMs (AAM, DNABERT, DNABERT-2, GROVER) for COVID-19 prediction from hospital-derived 16S rRNA microbiome data, demonstrating that domain-specific pretraining (AAM) yields superior predictive performance over general genomic models. COVID-LLM

Diseases of the Digestive System (XI)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Stomatology Cephgpt-4: An interactive multimodal cephalometric measurement and diagnostic system with visual large language model arXiv 2023/07 Multimodal fine-tuning automates cephalometric X-ray analysis and interactive doctor-patient dialogue. -
Dental Loop Chatbot: A Prototype Large Language Model Framework for Dentistry Software 2024 LLaMA2-based chatbot with RAG delivers real-time, guideline-driven clinical decision support for dental care. Dental-Loop-Chatbot
Hepatology Development of a liver disease–specific large language model chat interface using retrieval-augmented generation Hepatology 2024 RAG-integrated LLM provides accurate, knowledge-based Q&A and decision support for liver diseases. -
Gastroenterology GastroBot: a Chinese gastrointestinal disease chatbot based on the retrieval-augmented generation Frontiers in Medicine 2024 RAG-based chatbot with GI disease knowledge base delivers precise, explainable answers and diagnostic advice. ragbot

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Periodontal Diseases Development and Comparative Evaluation of a Reinstructed GPT-4o Model Specialized in Periodontology Journal of Clinical Periodontology 2025 GPT-4o enhanced with RAG and knowledge base provides accurate, context-aware answers in periodontology. -

Diseases of the Skin and Subcutaneous Tissue (XII)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Dermatology Pre-trained multimodal large language model enhances dermatological diagnosis using SkinGPT-4 Nature Communications 2024 Multimodal system aligns vision transformer and LLM to diagnose skin images and recommend interactive treatments. SkinGPT-4
SkinGEN: An explainable dermatology diagnosis-to-generation framework with interactive vision-language models IUI 2025 Uses SkinGPT-4 for image-based diagnosis and integrates Stable Diffusion for personalized visual explanations. -
OpenBioLLm-Derm: A Dermatology Large Language Model Based on Llama-3 - Fine-tuned LLaMA model providing accurate, clear, and helpful answers for dermatological Q&A and education. OpenBioLLm-Derm
SkinSavvy2: Augmented Skin Lesion Diagnosis and Personalized Medical Consultation System Electronics 2025 Presents SkinSavvy2, integrating state-of-the-art image classifiers and GPT-4 to provide accurate skin lesion diagnosis and personalized care recommendations. -
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks arXiv 2025/05 Presents MM-Skin, a comprehensive dermatology image-text dataset, and SkinVL, a fine-tuned vision-language model that sets new benchmarks in dermatology VQA and diagnosis. MM-Skin

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Mpox Virus MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection arXiv 2024/11 Multimodal VLM (CLIP, ViT, LLaMA2) jointly analyzes images and clinical info for accurate mpox diagnosis. -

Diseases of the Musculoskeletal System and Connective Tissue (XIII)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Orthopedics Ortho AI: World’s first artificial intelligence in orthopaedics Journal of Orthopaedic Case Reports 2023 Multimodal AI integrates imaging and text for automated bone disease recognition and decision support. -
Orthodoc: Multimodal large language model for assisting diagnosis in computed tomography arXiv 2024/09 Multimodal fine-tuning with RAG and reasoning for orthopedic CT interpretation and diagnostic reporting. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Rheumatoid Arthritis Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine arXiv 2025/01 LoRA-adapted LLM fuses TCM and clinical data for RA diagnosis, syndrome differentiation, and treatment. -
Osteoarthritis Evaluating and Enhancing Large Language Models Performance in Domain-specific Medicine: Osteoarthritis Management with DocOA arXiv 2024/01 RAG-based GPT-4 enables evidence-based Q&A and individualized OA management using external knowledge. DocOA
Spondyloarthritis Assessing and Optimizing Large Language Models on Spondyloarthritis Multi-Choice Question Answering: Protocol for Enhancement and Assessment JMIR Res Protoc 2024/05 Proposes a 222-question SpA benchmark, fine-tunes LLMs with real clinical data, and establishes an evaluation protocol to improve diagnostic accuracy and reasoning for spondyloarthritis. -
Spine SpineGPT: AI assisted total spinal care solution - Fine-tuned multimodal LLM provides diagnosis, counseling, and surgical support for spinal diseases. spineai
Rib Fracture OrthoInsight: Rib Fracture Diagnosis and Report Generation Based on Multi-Modal Large Models arXiv 2025/07 Introduces a multimodal model integrating image detection and medical knowledge to automate rib fracture diagnosis and generate superior CT reports. -
Chronic low back pain (CLBP) Enhancing treatment decision-making for low back pain: a novel framework integrating large language models with retrieval-augmented generation technology Frontiers in Medicine 2025/05 Presents CLBP-ClinicGPT, a hybrid LLM and RAG system with expert-style prompting, delivering superior and personalized treatment recommendations for chronic low back pain over baseline models. -

Diseases of the Genitourinary System (XIV)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Nephrology KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease arXiv 2025/03 Integrates LLMs and a nephrology knowledge base for documentation-enhanced QA and decision support in kidney diseases. KidneyTalk-open

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Acute Kidney Injury AKIBoards: A Structure-Following Multiagent System for Predicting Acute Kidney Injury arXiv 2025/04 Introduces AKIBoards, a multiagent LLM framework using global structure learning and agent collaboration for more accurate and explainable AKI prediction. -
Kidney Stone Identifying Kidney Stone Risk Factors Through Patient Experiences With a Large Language Model: Text Analysis and Empirical Study Journal of Medical Internet Research 2025/05 Presents KS-GPT, a GPT-4 model with expert-guided prompting to accurately identify known and novel kidney stone risk factors from Chinese social media. -
Chronic Kidney Disease CKD-AI - GPT-4-powered chatbot providing personalized CKD information and self-management guidance. CKD-AI
Kidney Transplantation exKidneyBERT: a language model for kidney transplant pathology reports and the crucial role of extended vocabularies PeerJ Computer Science 2024 Uses an extended Clinical BERT to extract and classify key pathology report information in kidney transplantation. exKidneyBERT

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Gestational Diabetes Developing a GraphRAG-enabled local-LLM for Gestational Diabetes Mellitus medRxiv 2025/04 Introduces a GraphRAG-based local LLM that uses knowledge graphs for explainable, accurate, and context-aware decision support in gestational diabetes management. -

Certain Conditions Originating in the Perinatal Period (XVI)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Pediatrics PediatricsGPT: Large language models as chinese medical assistants for pediatric applications NeurIPS 2024 Multi-stage pre-training and instruction tuning for pediatric Q&A, diagnosis, and treatment recommendations. PediatricsGPT
A Medical Multimodal Large Language Model for Pediatric Pneumonia IEEE Journal of Biomedical and Health Informatics 2025 Multimodal encoders and staged training to generate pediatric pneumonia reports from text and images. -
MedicalGLM: A Pediatric Medical Question Answering Model with a quality evaluation mechanism Journal of Biomedical Informatics 2025 Reward modeling and quality-driven fine-tuning for high-quality pediatric medical responses. -
Pediatric Cardiology Development and Validation of a Pediatric Cardiology-Specific Large Language Model Chat Interface using Retrieval Augmented Generation Circulation 2024 Retrieval-augmented generation and prompt engineering for specialized pediatric cardiology Q&A interface. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Perioperative Sepsis Large language models for predicting perioperative sepsis Applied Intelligence 2025 Presents an interpretable Gemini-based LLM that textualizes perioperative sepsis data for accurate and explainable prediction and treatment support. -

Congenital Malformations, Deformations, and Chromosomal Abnormalities (XVII)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Rare Disease Rare disease diagnosis using knowledge guided retrieval augmentation for ChatGPT Journal of Biomedical Informatics 2024 Retrieval-augmented generation (RAG) enhances ChatGPT for context-aware rare disease diagnosis with explainable reasoning. -
RDguru: a conversational intelligent agent for rare diseases IEEE Journal of Biomedical and Health Informatics 2024 Integrates LangChain-based RAG, ontology-based phenotype annotation, and multi-source fusion for traceable rare disease diagnosis. -
Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge arXiv 2024/11 LoRA fine-tuned LLM with precise RAG pipeline provides accessible, well-cited Ehlers-Danlos Syndrome responses. zebra-llama
RareAgents: Advancing Rare Disease Care through LLM-Empowered Multi-disciplinary Team arXiv 2024/12 Introduces RareAgents, an LLM-based multi-agent framework for rare disease diagnosis and treatment, and the MIMIC-IV-EXT-RARE dataset. -
RDmaster: A novel phenotype-oriented dialogue system supporting differential diagnosis of rare disease Computers in Biology and Medicine 2024 Introduces RDmaster, a web-based Q&A system that enhances rare disease diagnosis by actively collecting key phenotypes and outperforming LLMs and existing tools. -

Specific Diseases

Diseases Paper Submitted in Description Repo/Demo
Congenital Heart Disease Development and Validation of a Pediatric Cardiology-Specific Large Language Model Chat Interface using Retrieval Augmented Generation Circulation 2024 Retrieval-augmented generation and prompt engineering for pediatric cardiology clinical question answering. -

Factors Influencing Health Status and Contact with Health Services (XXI)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Radiology Roentgen: vision-language foundation model for chest x-ray generation arXiv 2022/11 Latent diffusion model for text-to-image chest X-ray synthesis and data augmentation. RoentGen
Xraygpt: Chest radiographs summarization using medical vision-language models arXiv 2023/06 Multimodal architecture aligning MedClip encoder with Vicuna LLM for image-grounded summaries. XrayGPT
CohortGPT: An enhanced gpt for participant recruitment in clinical study arXiv 2023/07 Uses knowledge graphs and dynamic CoT prompting for clinical text classification in participant recruitment. -
Radiology-Llama2: Best-in-class large language model for radiology arXiv 2023/09 Instruction-tuned and LoRA-fine-tuned Llama for radiology report generation. -
ChatRadio-Valuer: A chat large language model for generalizable radiology report generation arXiv 2023/10 Supervised fine-tuning on Llama2 with domain-specific data for radiology report generation. -
Radialog: A large vision-language model for radiology report generation and conversational assistance arXiv 2023/11 Vision-language pipeline for interactive radiology report generation and assistance. RaDialog
Cxr-clip: Toward large scale chest x-ray language-image pre-training MICCAI 2023 CLIP-based vision-language model for zero/few-shot disease classification and retrieval. cxr-clip
R2gengpt: Radiology report generation with frozen llms Meta-Radiology 2023 Vision-language pipeline aligning visual features with LLMs for automated report generation. R2GenGPT
A Vision-Language foundation model to enhance efficiency of chest x-ray interpretation arXiv 2024/01 Foundation vision-language model for comprehensive chest X-ray interpretation. CheXagent
Radiology-GPT: a large language model for radiology Meta-Radiology 2025 LoRA-based fine-tuning on Llama for generating clinical impressions from radiological findings. -
Pathology PathGPT - Fine-tuned Llama-7B for pathology question answering. PathGPT
A visual-language foundation model for computational pathology Nature Medicine 2024 Contrastive vision-language foundation model for histology classification, segmentation, and retrieval. CONCH
Pa-llava: A large language-vision assistant for human pathology image understanding IEEE International Conference on Bioinformatics and Biomedicine 2024 Multimodal assistant for pathology image understanding and visual Q&A via staged training. PA-LLaVA
Anesthesiology Hypnos: A domain-specific large language model for anesthesiology Neurocomputing 2025 Progressively fine-tuned Llama for anesthesia-specific question answering and exam tasks. -

Reference Awesome-repo


Codes for Special Purposes (XXII)

Medical Specialities

Speciality Paper Submitted in Description Repo/Demo
Traditional Chinese Medicine Qibo: A large language model for traditional chinese medicine arXiv 2024/03 Two-phase training with retrieval-augmented prompting for TCM Q&A and prescription entity recognition. -
BianCang: A Traditional Chinese Medicine Large Language Model arXiv 2024/11 Two-stage training on Qwen2/2.5 for improved syndrome differentiation, diagnosis, and Q&A. BianCang-TCM-LLM
Lingdan: enhancing encoding of traditional Chinese medicine knowledge for clinical reasoning tasks with large language models JAMIA 2024 QLoRA fine-tuning and chain-of-thought reasoning for patent medicine Q&A, symptom analysis, and herbal prescription recommendation. LingdanLLM
TCMChat: A generative large language model for traditional Chinese medicine Pharmacological Research 2024 Pre-training and supervised fine-tuning for TCM knowledge Q&A, diagnosis, and formula recommendation. TCMChat
TCM-GPT: Efficient pre-training of large language models for domain adaptation in Traditional Chinese Medicine Computer Methods and Programs in Biomedicine Update 2024 Keyword-driven corpus retrieval and LoRA-based fine-tuning for TCM exams and clinical diagnosis. -
MedChatZH: A tuning LLM for traditional Chinese medicine consultations Computers in Biology and Medicine 2024 Continued pre-training and instruction tuning on Baichuan-7B for TCM Q&A and patient dialogue. MedChatZH
PresRecST: A novel herbal prescription recommendation algorithm for real-world patients with integration of syndrome differentiation and treatment planning Oxford University Press 2024 A knowledge graph-based model for TCM prescription recommendation aligned with clinical practice. PresRecST
Zhongjing: Enhancing the chinese medical capabilities of large language model through expert feedback and real-world multi-turn dialogue AAAI 2024 Continual pre-training and RLHF for multi-turn TCM dialogue, diagnostic support, and drug recommendation. Zhongjing
CPMI-ChatGLM: parameter-efficient fine-tuning ChatGLM with Chinese patent medicine instructions Scientific Reports 2024 Parameter-efficient fine-tuning for patent medicine recommendation and usage instruction automation. CPMI-ChatGLM
TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction IEEE International Conference on Bioinformatics and Biomedicine 2024 LoRA-based supervised fine-tuning for herbal prescription and dosage prediction. -
TCM-KLLaMA: Intelligent generation model for Traditional Chinese Medicine Prescriptions based on knowledge graph and large language model Computers in Biology and Medicine 2025 Knowledge graph and synonym matching with LoRA fine-tuning for improved prescription accuracy. -
MCM: Multimodal Chinese Medical Large Model - Continual pre-training and multimodal fusion for comprehensive TCM Q&A, consultation, and knowledge graph construction. -
TCMLLM: Traditional Chinese Medicine Model - Large-scale instruction tuning of ChatGLM for auxiliary diagnosis, syndrome differentiation, and prescription generation. -

Reference Awesome-repo

Star History

Star History Chart

About

A collection of research on specialized medical LLMs for specific diseases and distinct medical specialties, organized by ICD-10 chapters.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages