This website requires JavaScript.
Explore
Help
Register
Sign In
ros
/
nanochat
Watch
1
Star
0
Fork
0
You've already forked nanochat
mirror of
https://github.com/karpathy/nanochat.git
synced
2026-01-30 04:22:02 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
e3f58b838e98a5ea013a3c1773fde9d4a3c5d090
nanochat
/
scripts
History
Andrej Karpathy
8203efa919
implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
2026-01-16 17:37:51 +00:00
..
base_eval.py
bugfix
2025-12-26 19:02:12 +08:00
base_loss.py
fix buggy midtrain and update all kwargs to be idiomatic. that is, argparse uses dashes variables use underscores. the underscores are just a remnant of the previous Configurator object. This is the right way
2026-01-13 22:45:27 +00:00
base_train.py
implement flash attention 3 fallback to pytorch sdpa by touching as few lines of code as possible in main files and keeping all implementation to a single file. add tests. add helpful warning messages for the user.
2026-01-16 17:37:51 +00:00
chat_cli.py
upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming
2025-10-20 10:15:17 -07:00
chat_eval.py
Fix args in readme (
#438
)
2026-01-15 16:26:38 -08:00
chat_rl.py
typo in comments: change "GAPO" to "DAPO"
2026-01-15 22:03:42 -08:00
chat_sft.py
fix buggy midtrain and update all kwargs to be idiomatic. that is, argparse uses dashes variables use underscores. the underscores are just a remnant of the previous Configurator object. This is the right way
2026-01-13 22:45:27 +00:00
chat_web.py
ensure consistency of quotes within each statement
2025-11-03 21:52:02 +01:00
mid_train.py
fix buggy midtrain and update all kwargs to be idiomatic. that is, argparse uses dashes variables use underscores. the underscores are just a remnant of the previous Configurator object. This is the right way
2026-01-13 22:45:27 +00:00
tok_eval.py
initial commit
2025-10-13 06:49:24 -07:00
tok_train.py
fix buggy midtrain and update all kwargs to be idiomatic. that is, argparse uses dashes variables use underscores. the underscores are just a remnant of the previous Configurator object. This is the right way
2026-01-13 22:45:27 +00:00