revamped val (bootstrap ci to support seq and tok intervals, many perplexity evals) #142

sagadre · 2023-12-08T04:34:28Z

No description provided.

…pen_lm into bootstrap_ci_pt2

achalddave

Left a couple comments, not blocking

achalddave · 2023-12-11T16:55:13Z

open_lm/params.py

-    parser.add_argument("--eps", type=float, default=None, help="Adam epsilon.")
+    parser.add_argument("--lr", type=float, default=5.0e-4, help="Learning rate.")
+    parser.add_argument("--beta1", type=float, default=0.9, help="Adam beta 1.")
+    parser.add_argument("--beta2", type=float, default=0.95, help="Adam beta 2.")


We're changing the default beta2 here compared to before right?

yeah but beta2 of 0.95 has been pretty standard for most open_lm runs so far

achalddave · 2023-12-11T16:56:14Z

open_lm/train.py

-        "loss_upper_95": upper,
+        "loss_sequences_lower_95": lower_seq,
+        "loss_sequences_upper_95": upper_seq,
+        "loss_tokens_lower_95": lower_tok,


Should we also save loss tokens mean?

given that we never see partial sequences i think this will just be loss

Vaishaal

minor nit but LGTM

Vaishaal · 2023-12-11T17:44:38Z

open_lm/main.py

+
+                if is_master(args):
+                    with fsspec.open(os.path.join(args.checkpoint_path, "results.jsonl"), "a") as f:
+                        f.write(json.dumps(evaluation_metrics))


it might be good to do this as 1 write, since we are doing this over fsspec, there could be a network error and we are left with a corrupt jsonl or something (without the newline)

as in do the dumps + \n at once so its idempotent and keeps jsonl consistent. In general append mode is scary

sagadre added 3 commits December 8, 2023 04:33

modified val bootstrap ci to support seq and tok intervals

26e3d39

support eval on many val sets independently

0b81088

version bump

676c1b6

sagadre changed the title ~~modified val bootstrap ci to support seq and tok intervals~~ revamped val (bootstrap ci to support seq and tok intervals, many perplexity evals) Dec 8, 2023

sagadre and others added 6 commits December 8, 2023 19:41

minor

0903ce9

embed all eval metrics

dedd7a3

Merge branch 'main' into bootstrap_ci_pt2

58330f9

reduce num calls to cat

35a2523

Merge branch 'bootstrap_ci_pt2' of https://github.com/mlfoundations/o…

59dafb6

…pen_lm into bootstrap_ci_pt2

version bump

946cf0c

achalddave approved these changes Dec 11, 2023

View reviewed changes

Vaishaal approved these changes Dec 11, 2023

View reviewed changes

nits

1e704e9

sagadre merged commit 73e7b37 into main Dec 11, 2023
2 checks passed

sagadre deleted the bootstrap_ci_pt2 branch December 11, 2023 18:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

revamped val (bootstrap ci to support seq and tok intervals, many perplexity evals) #142

revamped val (bootstrap ci to support seq and tok intervals, many perplexity evals) #142

sagadre commented Dec 8, 2023

achalddave left a comment

achalddave Dec 11, 2023

sagadre Dec 11, 2023

achalddave Dec 11, 2023

sagadre Dec 11, 2023

Vaishaal left a comment

Vaishaal Dec 11, 2023

Vaishaal Dec 11, 2023

sagadre Dec 11, 2023

revamped val (bootstrap ci to support seq and tok intervals, many perplexity evals) #142

revamped val (bootstrap ci to support seq and tok intervals, many perplexity evals) #142

Conversation

sagadre commented Dec 8, 2023

achalddave left a comment

Choose a reason for hiding this comment

achalddave Dec 11, 2023

Choose a reason for hiding this comment

sagadre Dec 11, 2023

Choose a reason for hiding this comment

achalddave Dec 11, 2023

Choose a reason for hiding this comment

sagadre Dec 11, 2023

Choose a reason for hiding this comment

Vaishaal left a comment

Choose a reason for hiding this comment

Vaishaal Dec 11, 2023

Choose a reason for hiding this comment

Vaishaal Dec 11, 2023

Choose a reason for hiding this comment

sagadre Dec 11, 2023

Choose a reason for hiding this comment