This website requires JavaScript.
Explore
Help
Register
Sign In
ros
/
nanochat
Watch
1
Star
0
Fork
0
You've already forked nanochat
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
05a051dbe92c3617529a607d4fb652355bada804
nanochat
/
scripts
History
Andrej Karpathy
8892470f29
add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
2025-10-24 14:02:48 +00:00
..
base_eval.py
many small tweaks. base, eval, core work now i think
2025-10-16 15:46:18 -07:00
base_loss.py
many small tweaks. base, eval, core work now i think
2025-10-16 15:46:18 -07:00
base_train.py
move the lr schedule args up in base_train so they are tunable in configurator
2025-10-24 13:27:31 +00:00
chat_cli.py
upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming
2025-10-20 10:15:17 -07:00
chat_eval.py
add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
2025-10-24 14:02:48 +00:00
chat_rl.py
initial commit
2025-10-13 06:49:24 -07:00
chat_sft.py
add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
2025-10-24 14:02:48 +00:00
chat_web.py
upgrading all other files to be able to use cpu/mps as well as cuda. various minor other changes ,e.g. changing max_iterations to num_iterations in sft script for consistency in naming
2025-10-20 10:15:17 -07:00
mid_train.py
add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think
2025-10-24 14:02:48 +00:00
tok_eval.py
initial commit
2025-10-13 06:49:24 -07:00
tok_train.py
initial commit
2025-10-13 06:49:24 -07:00