Commit c48bf1b
Mamba new data fixes2 (#149)
* make mamba
* add quick debug
* add quick debug
* revert debug verbosity
* Learning rate scheduler changed (Constant)
* Add AutoHandler
* Add Auto cfg option for AutoHAndler
* Len gets called before open
* path/filepath typo fix
* Partitioning fix from mup-search
* Cosine 0.01 decay
* Warmup interval change
* Schedule change
* Constant schedule
* LR schedule change (cool down and constant lr)
* Update dataset_utils.py
Added a check for length of doc
* LR schedule change (Warmup + constant)
* Update dataset_utils.py
* Cosine schedule
* For constant lr 1.5e5
* Schedule change
* Schedule change
* Final singlefile checkpoint saves one folder up (#127)
* Final singlefile checkpoint saves one folder up
Signed-off-by: Davis Wertheimer <[email protected]>
* save file under new pth subfolder
Signed-off-by: Davis Wertheimer <[email protected]>
* Repath for easier consumption/conversion
Signed-off-by: Davis Wertheimer <[email protected]>
---------
Signed-off-by: Davis Wertheimer <[email protected]>
* Added cool down
* length of doc check
* splitstrip cols and pass to fhandler
* fhandler col_names support
* Warmup for annealing
* Debugging
* Debugging II
* Empty shard check
* Added constant lr schedule with warmup
* added print for lenght of doc
* added print for lenght of doc II
* Update dataset_utils.py
* Update dataset_utils.py
* Update dataset_utils.py
* Update dataset_utils.py
* Adding print for debug
* Revert "Pulled from data-fixes branch"
This reverts commit ac5194b, reversing
changes made to 1b50708.
reverting changes
* Revert all changes made after March 6 (before merge)
* Revert all changes made after March 6 (before merge)
* removed print
---------
Signed-off-by: Davis Wertheimer <[email protected]>
Co-authored-by: Linsong Chu <[email protected]>
Co-authored-by: divykum2 <[email protected]>
Co-authored-by: divya-kumari32 <[email protected]>1 parent 6d751e5 commit c48bf1b
File tree
8 files changed
+144
-400
lines changed- fms_fsdp
- config
- utils
- tests
8 files changed
+144
-400
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
76 | 76 | | |
77 | 77 | | |
78 | 78 | | |
79 | | - | |
80 | | - | |
81 | | - | |
82 | | - | |
83 | | - | |
84 | | - | |
85 | | - | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
324 | 324 | | |
325 | 325 | | |
326 | 326 | | |
327 | | - | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
328 | 330 | | |
329 | 331 | | |
330 | 332 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6 | 6 | | |
7 | 7 | | |
8 | 8 | | |
9 | | - | |
10 | 9 | | |
11 | 10 | | |
12 | 11 | | |
| |||
59 | 58 | | |
60 | 59 | | |
61 | 60 | | |
62 | | - | |
| 61 | + | |
63 | 62 | | |
64 | | - | |
| 63 | + | |
65 | 64 | | |
66 | 65 | | |
67 | 66 | | |
| |||
72 | 71 | | |
73 | 72 | | |
74 | 73 | | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
75 | 77 | | |
76 | 78 | | |
77 | | - | |
78 | | - | |
79 | | - | |
80 | | - | |
81 | 79 | | |
82 | 80 | | |
83 | 81 | | |
| |||
94 | 92 | | |
95 | 93 | | |
96 | 94 | | |
97 | | - | |
| 95 | + | |
98 | 96 | | |
99 | 97 | | |
100 | 98 | | |
| |||
124 | 122 | | |
125 | 123 | | |
126 | 124 | | |
127 | | - | |
128 | 125 | | |
129 | 126 | | |
130 | | - | |
| 127 | + | |
131 | 128 | | |
132 | 129 | | |
133 | 130 | | |
134 | 131 | | |
135 | 132 | | |
136 | 133 | | |
137 | 134 | | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
| 135 | + | |
151 | 136 | | |
152 | | - | |
153 | | - | |
154 | | - | |
| 137 | + | |
| 138 | + | |
155 | 139 | | |
156 | 140 | | |
157 | 141 | | |
| |||
181 | 165 | | |
182 | 166 | | |
183 | 167 | | |
184 | | - | |
| 168 | + | |
0 commit comments