-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: Lightning-AI/litgpt
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
chore(utils): add type annotations to public functions in utils.py
#2237
opened Apr 14, 2026 by
nuthalapativarun
Loading…
docs(pretrain): add TinyStories pretraining section
#2236
opened Apr 14, 2026 by
nuthalapativarun
Loading…
fix(model): convert bool mask_cache to float additive mask for softcapping
#2235
opened Apr 14, 2026 by
nuthalapativarun
Loading…
fix(finetune): all_reduce val_loss for initial/final evaluation in multi-GPU training
#2224
opened Mar 30, 2026 by
NIK-TIGER-BILL
Loading…
3 tasks done
Fix division by zero in LR scheduler when max_steps equals warmup_steps
#2212
opened Mar 9, 2026 by
Br1an67
Contributor
Loading…
Fix Mistral tokenizer missing spaces in decode_stream (Issue #1822)
#2211
opened Mar 5, 2026 by
mrshibly
Loading…
Moving to lazy root imports to make config loading snappy
enhancement
New feature or request
performance
Add Multi-head Latent Attention (DeepSeekv2)
enhancement
New feature or request
waiting on author
#1945
opened Feb 25, 2025 by
simoneangarano
•
Draft
Raise error if disk is full before downloading weights
bug
Something isn't working
waiting on author
OpenCoder series
enhancement
New feature or request
waiting on author
#1880
opened Dec 21, 2024 by
ysjprojects
Collaborator
•
Draft
Add LongLora for both full and lora fine-tuning
enhancement
New feature or request
waiting on author
ProTip!
Follow long discussions with comments:>50.