Skip to content

Issues: Lightning-AI/litgpt

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Errors when try to save checkpoints during the full fine-tuning question Further information is requested
#1764 opened Oct 1, 2024 by lemon-awa
Molmo support enhancement New feature or request
#1763 opened Sep 30, 2024 by win4r
Issue with Dolly Dataloader: context key not found! bug Something isn't working
#1760 opened Sep 28, 2024 by pytholic
TypeError: TextInputSequence must be str bug Something isn't working
#1759 opened Sep 28, 2024 by hemanth
Support 128k token version of Phi 3 enhancement New feature or request model-weights
#1739 opened Sep 24, 2024 by rasbt
Question about tie_embeddings question Further information is requested
#1727 opened Sep 14, 2024 by twaka
Data Loading bug in pretrain on resume over multiple epochs bug Something isn't working
#1712 opened Sep 7, 2024 by fdalvi
Qwen series model-weights question Further information is requested
#1709 opened Sep 3, 2024 by Godlikemandyy
[BUG] LLaMA 3.1 RoPE bug Something isn't working question Further information is requested
#1699 opened Aug 28, 2024 by zzhhjjj
Microsoft Phi 3.5 MoE enhancement New feature or request model-weights
#1686 opened Aug 21, 2024 by rasbt
attention mask is incorrect when generate with softcapping bug Something isn't working
#1672 opened Aug 13, 2024 by twaka
Disable KV cache option enhancement New feature or request
#1671 opened Aug 12, 2024 by rasbt
Gemma 2B weights seem to have changed bug Something isn't working
#1665 opened Aug 8, 2024 by rasbt
Tensor parallelism generates non-sensical outputs bug Something isn't working
#1663 opened Aug 8, 2024 by rasbt
Use FlexAttention enhancement New feature or request performance
#1662 opened Aug 8, 2024 by rasbt
TPU Pod Training question Further information is requested
#1643 opened Jul 30, 2024 by opooladz
access hidden layer(s) from a model question Further information is requested
#1642 opened Jul 30, 2024 by Byungsooo
Implement prompt caching to speed up inference enhancement New feature or request
#1638 opened Jul 27, 2024 by rasbt
Skip safetensors->bin file conversion enhancement New feature or request
#1625 opened Jul 24, 2024 by rasbt
ProTip! no:milestone will show everything without a milestone.