-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Minor] Expose is_leaf_nontensor and default_is_leaf #1219
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Feb 17, 2025
ghstack-source-id: aeaa6981081b45430f6664530f3dab7cc3d21759 Pull Request resolved: #1219
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 49.2520μs | 21.2272μs | 47.1094 KOps/s | 46.7318 KOps/s | |
test_plain_set_stack_nested | 53.5690μs | 21.3297μs | 46.8830 KOps/s | 46.9381 KOps/s | |
test_plain_set_nested_inplace | 50.7940μs | 23.4996μs | 42.5540 KOps/s | 43.2980 KOps/s | |
test_plain_set_stack_nested_inplace | 65.6320μs | 23.0580μs | 43.3689 KOps/s | 43.5251 KOps/s | |
test_items | 26.5990μs | 4.1880μs | 238.7760 KOps/s | 241.9067 KOps/s | |
test_items_nested | 0.5300ms | 0.4097ms | 2.4405 KOps/s | 2.4671 KOps/s | |
test_items_nested_locked | 0.8495ms | 0.4116ms | 2.4297 KOps/s | 2.4745 KOps/s | |
test_items_nested_leaf | 0.1410ms | 79.5916μs | 12.5641 KOps/s | 12.7050 KOps/s | |
test_items_stack_nested | 0.5753ms | 0.4126ms | 2.4237 KOps/s | 2.4519 KOps/s | |
test_items_stack_nested_leaf | 0.1551ms | 81.0299μs | 12.3411 KOps/s | 12.6706 KOps/s | |
test_items_stack_nested_locked | 0.4947ms | 0.4117ms | 2.4290 KOps/s | 2.4723 KOps/s | |
test_keys | 21.2190μs | 3.5450μs | 282.0851 KOps/s | 284.4843 KOps/s | |
test_keys_nested | 0.2890ms | 0.1703ms | 5.8729 KOps/s | 6.0305 KOps/s | |
test_keys_nested_locked | 1.9607ms | 0.1773ms | 5.6408 KOps/s | 5.7813 KOps/s | |
test_keys_nested_leaf | 0.2262ms | 0.1475ms | 6.7807 KOps/s | 6.8408 KOps/s | |
test_keys_stack_nested | 0.2988ms | 0.1699ms | 5.8849 KOps/s | 5.9509 KOps/s | |
test_keys_stack_nested_leaf | 0.2363ms | 0.1468ms | 6.8108 KOps/s | 6.8339 KOps/s | |
test_keys_stack_nested_locked | 0.2694ms | 0.1741ms | 5.7445 KOps/s | 5.7847 KOps/s | |
test_values | 5.6464μs | 1.0275μs | 973.2267 KOps/s | 950.0069 KOps/s | |
test_values_nested | 0.1175ms | 64.6174μs | 15.4757 KOps/s | 15.6761 KOps/s | |
test_values_nested_locked | 0.1475ms | 65.2150μs | 15.3339 KOps/s | 15.5070 KOps/s | |
test_values_nested_leaf | 0.1382ms | 74.0232μs | 13.5093 KOps/s | 13.5853 KOps/s | |
test_values_stack_nested | 0.1166ms | 65.8706μs | 15.1813 KOps/s | 15.7153 KOps/s | |
test_values_stack_nested_leaf | 0.1401ms | 74.5110μs | 13.4208 KOps/s | 13.6839 KOps/s | |
test_values_stack_nested_locked | 0.1146ms | 65.4194μs | 15.2860 KOps/s | 14.8117 KOps/s | |
test_membership | 23.7140μs | 0.8675μs | 1.1527 MOps/s | 1.0992 MOps/s | |
test_membership_nested | 22.4120μs | 2.8751μs | 347.8093 KOps/s | 345.5078 KOps/s | |
test_membership_nested_leaf | 23.7740μs | 2.8873μs | 346.3410 KOps/s | 340.1063 KOps/s | |
test_membership_stacked_nested | 43.5110μs | 2.8647μs | 349.0758 KOps/s | 343.3871 KOps/s | |
test_membership_stacked_nested_leaf | 23.6430μs | 2.8630μs | 349.2794 KOps/s | 344.8807 KOps/s | |
test_membership_nested_last | 23.1030μs | 4.3384μs | 230.5020 KOps/s | 227.8447 KOps/s | |
test_membership_nested_leaf_last | 25.7480μs | 4.2740μs | 233.9715 KOps/s | 227.6509 KOps/s | |
test_membership_stacked_nested_last | 31.2780μs | 4.2976μs | 232.6890 KOps/s | 227.7432 KOps/s | |
test_membership_stacked_nested_leaf_last | 36.3770μs | 4.2756μs | 233.8831 KOps/s | 225.7237 KOps/s | |
test_nested_getleaf | 45.8860μs | 10.9371μs | 91.4323 KOps/s | 92.8928 KOps/s | |
test_nested_get | 34.7440μs | 10.4466μs | 95.7246 KOps/s | 98.6612 KOps/s | |
test_stacked_getleaf | 58.5010μs | 10.7984μs | 92.6061 KOps/s | 95.7431 KOps/s | |
test_stacked_get | 29.7750μs | 10.3158μs | 96.9385 KOps/s | 100.5358 KOps/s | |
test_nested_getitemleaf | 46.7660μs | 11.6890μs | 85.5509 KOps/s | 87.8458 KOps/s | |
test_nested_getitem | 53.5800μs | 10.9149μs | 91.6183 KOps/s | 92.2016 KOps/s | |
test_stacked_getitemleaf | 33.6220μs | 11.5402μs | 86.6535 KOps/s | 88.1860 KOps/s | |
test_stacked_getitem | 35.5060μs | 10.9645μs | 91.2036 KOps/s | 91.5472 KOps/s | |
test_lock_nested | 0.5690ms | 0.4058ms | 2.4642 KOps/s | 2.4491 KOps/s | |
test_lock_stack_nested | 0.6447ms | 0.4208ms | 2.3763 KOps/s | 2.3641 KOps/s | |
test_unlock_nested | 0.4309ms | 0.3353ms | 2.9827 KOps/s | 2.9883 KOps/s | |
test_unlock_stack_nested | 0.5326ms | 0.3433ms | 2.9128 KOps/s | 2.9093 KOps/s | |
test_flatten_speed | 0.1979ms | 0.1031ms | 9.6989 KOps/s | 9.7116 KOps/s | |
test_unflatten_speed | 0.9192ms | 0.5252ms | 1.9040 KOps/s | 1.8763 KOps/s | |
test_common_ops | 6.1426ms | 0.8119ms | 1.2316 KOps/s | 1.2180 KOps/s | |
test_creation | 24.7360μs | 2.4713μs | 404.6499 KOps/s | 401.3232 KOps/s | |
test_creation_empty | 45.4240μs | 13.6454μs | 73.2849 KOps/s | 82.5030 KOps/s | |
test_creation_nested_1 | 47.6990μs | 16.6405μs | 60.0944 KOps/s | 66.7315 KOps/s | |
test_creation_nested_2 | 51.6160μs | 20.9913μs | 47.6388 KOps/s | 51.0928 KOps/s | |
test_clone | 65.9720μs | 13.2899μs | 75.2449 KOps/s | 73.3426 KOps/s | |
test_getitem[int] | 0.9862ms | 12.6835μs | 78.8425 KOps/s | 80.7598 KOps/s | |
test_getitem[slice_int] | 0.1491ms | 24.4304μs | 40.9327 KOps/s | 41.0105 KOps/s | |
test_getitem[range] | 0.1617ms | 50.4870μs | 19.8071 KOps/s | 20.0669 KOps/s | |
test_getitem[tuple] | 0.1319ms | 19.7963μs | 50.5145 KOps/s | 49.6555 KOps/s | |
test_getitem[list] | 0.1797ms | 45.8480μs | 21.8112 KOps/s | 22.1242 KOps/s | |
test_setitem_dim[int] | 48.4300μs | 25.6079μs | 39.0505 KOps/s | 37.0240 KOps/s | |
test_setitem_dim[slice_int] | 80.6400μs | 51.5488μs | 19.3991 KOps/s | 18.5661 KOps/s | |
test_setitem_dim[range] | 0.1062ms | 76.8880μs | 13.0059 KOps/s | 12.7011 KOps/s | |
test_setitem_dim[tuple] | 63.9990μs | 40.6238μs | 24.6161 KOps/s | 23.4366 KOps/s | |
test_setitem | 76.6420μs | 21.0801μs | 47.4380 KOps/s | 46.7253 KOps/s | |
test_set | 90.6780μs | 20.5567μs | 48.6460 KOps/s | 48.3575 KOps/s | |
test_set_shared | 4.1750ms | 0.1832ms | 5.4579 KOps/s | 5.3985 KOps/s | |
test_update | 0.1034ms | 24.4713μs | 40.8642 KOps/s | 41.4404 KOps/s | |
test_update_nested | 95.4370μs | 34.7243μs | 28.7983 KOps/s | 28.1484 KOps/s | |
test_update__nested | 0.5449ms | 33.7226μs | 29.6537 KOps/s | 29.0773 KOps/s | |
test_set_nested | 63.5580μs | 22.6753μs | 44.1008 KOps/s | 43.2874 KOps/s | |
test_set_nested_new | 85.5390μs | 27.7115μs | 36.0862 KOps/s | 35.4401 KOps/s | |
test_select | 0.1089ms | 44.5902μs | 22.4265 KOps/s | 22.0998 KOps/s | |
test_select_nested | 0.2877ms | 63.5438μs | 15.7372 KOps/s | 15.9531 KOps/s | |
test_exclude_nested | 0.1592ms | 81.5129μs | 12.2680 KOps/s | 12.3276 KOps/s | |
test_empty[True] | 0.6024ms | 0.4187ms | 2.3883 KOps/s | 2.4283 KOps/s | |
test_empty[False] | 11.7060μs | 1.3852μs | 721.9127 KOps/s | 727.6986 KOps/s | |
test_unbind_speed | 0.3521ms | 0.2696ms | 3.7090 KOps/s | 3.7053 KOps/s | |
test_unbind_speed_stack0 | 0.4848ms | 0.2667ms | 3.7498 KOps/s | 3.7335 KOps/s | |
test_unbind_speed_stack1 | 0.1002s | 0.7270ms | 1.3756 KOps/s | 1.2453 KOps/s | |
test_split | 0.1002s | 1.7268ms | 579.1196 Ops/s | 569.2583 Ops/s | |
test_chunk | 0.1025s | 1.7341ms | 576.6726 Ops/s | 628.3475 Ops/s | |
test_consolidate_njt[False-None] | 8.6634ms | 8.3824ms | 119.2974 Ops/s | 111.7083 Ops/s | |
test_creation[device0] | 0.2159ms | 89.5176μs | 11.1710 KOps/s | 11.2229 KOps/s | |
test_creation_from_tensor | 3.3996ms | 96.3463μs | 10.3792 KOps/s | 10.6688 KOps/s | |
test_add_one[memmap_tensor0] | 0.1262ms | 4.7839μs | 209.0343 KOps/s | 200.6198 KOps/s | |
test_contiguous[memmap_tensor0] | 19.6260μs | 0.5246μs | 1.9061 MOps/s | 1.9194 MOps/s | |
test_stack[memmap_tensor0] | 28.3930μs | 3.3329μs | 300.0372 KOps/s | 292.8272 KOps/s | |
test_memmaptd_index | 1.2636ms | 0.2340ms | 4.2730 KOps/s | 4.3187 KOps/s | |
test_memmaptd_index_astensor | 0.5040ms | 0.3193ms | 3.1314 KOps/s | 3.1431 KOps/s | |
test_memmaptd_index_op | 1.1751ms | 0.6122ms | 1.6334 KOps/s | 1.6664 KOps/s | |
test_serialize_model | 0.2065s | 0.1295s | 7.7208 Ops/s | 8.6201 Ops/s | |
test_serialize_model_pickle | 0.4552s | 0.3951s | 2.5310 Ops/s | 2.4870 Ops/s | |
test_serialize_weights | 0.1231s | 0.1135s | 8.8086 Ops/s | 8.8651 Ops/s | |
test_serialize_weights_returnearly | 0.1768s | 0.1586s | 6.3044 Ops/s | 5.9350 Ops/s | |
test_serialize_weights_pickle | 1.1056s | 0.7025s | 1.4234 Ops/s | 2.4094 Ops/s | |
test_serialize_weights_filesystem | 0.2517s | 0.1550s | 6.4515 Ops/s | 7.1017 Ops/s | |
test_serialize_model_filesystem | 0.1425s | 0.1379s | 7.2519 Ops/s | 6.6872 Ops/s | |
test_reshape_pytree | 64.8510μs | 25.9452μs | 38.5428 KOps/s | 38.0699 KOps/s | |
test_reshape_td | 77.3740μs | 31.7489μs | 31.4971 KOps/s | 31.1392 KOps/s | |
test_view_pytree | 70.8420μs | 26.0008μs | 38.4603 KOps/s | 38.7224 KOps/s | |
test_view_td | 94.1750μs | 39.7481μs | 25.1584 KOps/s | 25.4387 KOps/s | |
test_unbind_pytree | 0.1043ms | 29.6941μs | 33.6767 KOps/s | 34.1375 KOps/s | |
test_unbind_td | 0.3462ms | 39.7592μs | 25.1514 KOps/s | 25.2462 KOps/s | |
test_split_pytree | 71.5930μs | 29.0147μs | 34.4653 KOps/s | 34.3976 KOps/s | |
test_split_td | 0.5719ms | 46.0727μs | 21.7048 KOps/s | 22.0939 KOps/s | |
test_add_pytree | 86.3710μs | 35.6242μs | 28.0708 KOps/s | 27.7961 KOps/s | |
test_add_td | 0.1240ms | 59.7767μs | 16.7289 KOps/s | 17.3543 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1274ms | 66.7266μs | 14.9865 KOps/s | 15.1111 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3796ms | 0.1713ms | 5.8369 KOps/s | 5.7761 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1098ms | 45.6572μs | 21.9023 KOps/s | 22.0146 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.1952ms | 0.1178ms | 8.4874 KOps/s | 8.5142 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 75.5910μs | 27.9802μs | 35.7396 KOps/s | 36.3470 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1434ms | 59.1803μs | 16.8975 KOps/s | 16.5729 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1582ms | 81.6022μs | 12.2546 KOps/s | 12.5025 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1574ms | 68.2651μs | 14.6488 KOps/s | 14.9641 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1771ms | 0.1040ms | 9.6112 KOps/s | 9.5434 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3284ms | 0.2167ms | 4.6154 KOps/s | 4.6605 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 98.7530μs | 46.8183μs | 21.3592 KOps/s | 21.6259 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1938ms | 67.0742μs | 14.9089 KOps/s | 14.7588 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2168ms | 99.4213μs | 10.0582 KOps/s | 9.9861 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3288ms | 0.1979ms | 5.0533 KOps/s | 4.9065 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4835ms | 0.2343ms | 4.2676 KOps/s | 4.2905 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2068ms | 0.1068ms | 9.3593 KOps/s | 9.3307 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1206ms | 63.0673μs | 15.8561 KOps/s | 15.9152 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1077ms | 48.0030μs | 20.8320 KOps/s | 20.8799 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2629ms | 0.1580ms | 6.3280 KOps/s | 6.3146 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1749ms | 99.2902μs | 10.0715 KOps/s | 9.9744 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 51.5450μs | 20.9975μs | 47.6247 KOps/s | 46.9655 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1631ms | 67.0846μs | 14.9066 KOps/s | 14.4338 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1647ms | 82.3283μs | 12.1465 KOps/s | 12.2870 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1475ms | 68.5762μs | 14.5823 KOps/s | 14.9105 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2988ms | 0.2111ms | 4.7363 KOps/s | 4.6017 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.7926ms | 1.3793ms | 725.0074 Ops/s | 723.1493 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3012ms | 0.2017ms | 4.9569 KOps/s | 4.6776 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9887ms | 0.8063ms | 1.2402 KOps/s | 1.1710 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6720ms | 0.4440ms | 2.2524 KOps/s | 2.1788 KOps/s | |
test_compile_assign_and_add_stack[eager] | 4.3536ms | 2.8266ms | 353.7826 Ops/s | 365.1607 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 80.1390μs | 38.2321μs | 26.1560 KOps/s | 25.5042 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5330ms | 33.0461μs | 30.2608 KOps/s | 30.5907 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 76.5720μs | 30.5800μs | 32.7011 KOps/s | 32.2185 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 60.6630μs | 23.2365μs | 43.0357 KOps/s | 43.5235 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 87.8440μs | 31.3203μs | 31.9282 KOps/s | 31.9224 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 69.4390μs | 23.3958μs | 42.7428 KOps/s | 43.6060 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1040ms | 52.8486μs | 18.9220 KOps/s | 18.8593 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3649ms | 20.1255μs | 49.6883 KOps/s | 49.7020 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1074ms | 45.4157μs | 22.0188 KOps/s | 21.3758 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 47.8680μs | 18.8790μs | 52.9690 KOps/s | 53.9868 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1034ms | 46.6027μs | 21.4580 KOps/s | 21.2677 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 89.3060μs | 19.0404μs | 52.5200 KOps/s | 54.5201 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1548ms | 54.0732μs | 18.4935 KOps/s | 18.6210 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8925ms | 20.0199μs | 49.9504 KOps/s | 51.2991 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1495ms | 45.5263μs | 21.9653 KOps/s | 21.4946 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 56.3550μs | 18.4843μs | 54.1000 KOps/s | 54.2035 KOps/s | |
test_compile_indexing[int-pytree-compile] | 96.7390μs | 45.9953μs | 21.7414 KOps/s | 21.3112 KOps/s | |
test_compile_indexing[int-pytree-eager] | 51.1450μs | 18.5614μs | 53.8753 KOps/s | 54.7310 KOps/s | |
test_mod_add[eager] | 0.1066ms | 36.7164μs | 27.2358 KOps/s | 28.4236 KOps/s | |
test_mod_add[compile] | 0.1354ms | 62.2541μs | 16.0632 KOps/s | 15.7406 KOps/s | |
test_mod_add[compile-overhead] | 0.1191ms | 61.6373μs | 16.2240 KOps/s | 15.8302 KOps/s | |
test_mod_wrap[eager] | 0.4098ms | 0.2228ms | 4.4892 KOps/s | 4.4617 KOps/s | |
test_mod_wrap[compile] | 1.3476ms | 0.2242ms | 4.4605 KOps/s | 4.4178 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4063ms | 0.2208ms | 4.5287 KOps/s | 4.4279 KOps/s | |
test_mod_wrap_and_backward[eager] | 15.5718ms | 12.6401ms | 79.1135 Ops/s | 92.9399 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.8956ms | 11.2612ms | 88.8004 Ops/s | 90.1647 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.6304ms | 11.0723ms | 90.3156 Ops/s | 89.8755 Ops/s | |
test_seq_add[eager] | 0.2043ms | 0.1221ms | 8.1897 KOps/s | 8.2497 KOps/s | |
test_seq_add[compile] | 0.1573ms | 73.9757μs | 13.5180 KOps/s | 13.2087 KOps/s | |
test_seq_add[compile-overhead] | 0.1456ms | 72.6494μs | 13.7647 KOps/s | 13.3181 KOps/s | |
test_seq_wrap[eager] | 0.6411ms | 0.4471ms | 2.2366 KOps/s | 2.1516 KOps/s | |
test_seq_wrap[compile] | 0.3518ms | 0.2400ms | 4.1675 KOps/s | 4.1052 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3713ms | 0.2406ms | 4.1562 KOps/s | 4.1204 KOps/s | |
test_func_call_runtime[False-eager] | 0.9375ms | 0.5373ms | 1.8611 KOps/s | 1.8106 KOps/s | |
test_func_call_runtime[False-compile] | 0.7839ms | 0.4409ms | 2.2682 KOps/s | 2.2555 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5433ms | 0.4384ms | 2.2808 KOps/s | 2.2640 KOps/s | |
test_func_call_runtime[True-eager] | 0.9607ms | 0.7486ms | 1.3358 KOps/s | 1.3160 KOps/s | |
test_func_call_runtime[True-compile] | 0.8262ms | 0.4542ms | 2.2015 KOps/s | 2.1734 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6269ms | 0.4544ms | 2.2009 KOps/s | 2.1693 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6956ms | 0.5322ms | 1.8789 KOps/s | 1.8277 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5270ms | 0.4361ms | 2.2932 KOps/s | 2.2665 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7328ms | 0.4358ms | 2.2947 KOps/s | 2.2575 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1401ms | 0.8908ms | 1.1226 KOps/s | 1.0879 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9401ms | 0.7808ms | 1.2807 KOps/s | 1.2217 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.4211ms | 0.7882ms | 1.2687 KOps/s | 1.2012 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4971ms | 1.8609ms | 537.3832 Ops/s | 515.9601 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9015ms | 0.5295ms | 1.8887 KOps/s | 1.8513 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.3439ms | 0.5278ms | 1.8946 KOps/s | 1.8496 KOps/s | |
test_distributed | 0.2858ms | 0.1222ms | 8.1804 KOps/s | 7.8680 KOps/s | |
test_tdmodule | 50.9450μs | 27.0018μs | 37.0346 KOps/s | 36.3926 KOps/s | |
test_tdmodule_dispatch | 78.0450μs | 49.1941μs | 20.3276 KOps/s | 20.5425 KOps/s | |
test_tdseq | 44.7630μs | 29.0291μs | 34.4482 KOps/s | 34.7011 KOps/s | |
test_tdseq_dispatch | 81.6720μs | 55.0930μs | 18.1511 KOps/s | 17.9601 KOps/s | |
test_instantiation_functorch | 1.7881ms | 1.5148ms | 660.1461 Ops/s | 633.1129 Ops/s | |
test_exec_functorch | 0.2504ms | 0.1762ms | 5.6766 KOps/s | 5.5047 KOps/s | |
test_exec_functional_call | 0.3443ms | 0.1698ms | 5.8886 KOps/s | 5.7433 KOps/s | |
test_exec_td_decorator | 0.4489ms | 0.2307ms | 4.3354 KOps/s | 4.2523 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9002ms | 0.6496ms | 1.5395 KOps/s | 1.4964 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.4046ms | 0.6492ms | 1.5403 KOps/s | 1.5021 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7518ms | 0.5209ms | 1.9197 KOps/s | 1.8516 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8284ms | 0.5220ms | 1.9156 KOps/s | 1.8571 KOps/s | |
test_to_module_speed[True] | 2.2391ms | 1.3369ms | 747.9811 Ops/s | 754.9518 Ops/s | |
test_to_module_speed[False] | 2.2848ms | 1.2970ms | 771.0261 Ops/s | 763.0112 Ops/s | |
test_tc_init | 98.3020μs | 48.3317μs | 20.6904 KOps/s | 21.7447 KOps/s | |
test_tc_init_nested | 0.5714ms | 95.0479μs | 10.5210 KOps/s | 10.7447 KOps/s | |
test_tc_first_layer_tensor | 16.2400μs | 1.5360μs | 651.0563 KOps/s | 627.4183 KOps/s | |
test_tc_first_layer_nontensor | 16.6510μs | 4.6447μs | 215.2980 KOps/s | 207.0199 KOps/s | |
test_tc_second_layer_tensor | 21.1190μs | 2.8628μs | 349.3118 KOps/s | 340.0462 KOps/s | |
test_tc_second_layer_nontensor | 23.3630μs | 6.0055μs | 166.5128 KOps/s | 161.8942 KOps/s | |
test_unbind | 0.2255s | 14.8000ms | 67.5675 Ops/s | 61.6251 Ops/s | |
test_full_like | 9.3955ms | 7.9911ms | 125.1391 Ops/s | 128.3165 Ops/s | |
test_zeros_like | 10.4469ms | 4.6526ms | 214.9322 Ops/s | 225.6641 Ops/s | |
test_ones_like | 6.4055ms | 3.3201ms | 301.1973 Ops/s | 306.0819 Ops/s | |
test_clone | 9.3727ms | 6.7404ms | 148.3582 Ops/s | 200.4146 Ops/s | |
test_squeeze | 58.7090μs | 12.5515μs | 79.6716 KOps/s | 79.2912 KOps/s | |
test_unsqueeze | 0.1718ms | 93.8308μs | 10.6575 KOps/s | 10.8749 KOps/s | |
test_split | 0.4745ms | 0.1932ms | 5.1766 KOps/s | 5.1405 KOps/s | |
test_permute | 0.2893ms | 0.1972ms | 5.0698 KOps/s | 4.9841 KOps/s | |
test_stack | 28.6086ms | 24.9678ms | 40.0516 Ops/s | 40.4200 Ops/s | |
test_cat | 31.7545ms | 24.8555ms | 40.2325 Ops/s | 40.5024 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 37.6200μs | 13.1839μs | 75.8502 KOps/s | 79.0120 KOps/s | |
test_plain_set_stack_nested | 39.1910μs | 13.2333μs | 75.5670 KOps/s | 78.5419 KOps/s | |
test_plain_set_nested_inplace | 60.7510μs | 14.3912μs | 69.4867 KOps/s | 72.5460 KOps/s | |
test_plain_set_stack_nested_inplace | 41.1310μs | 14.2322μs | 70.2634 KOps/s | 73.2372 KOps/s | |
test_items | 41.2000μs | 2.8973μs | 345.1459 KOps/s | 336.5108 KOps/s | |
test_items_nested | 0.4016ms | 0.3700ms | 2.7025 KOps/s | 2.7622 KOps/s | |
test_items_nested_locked | 0.4531ms | 0.3719ms | 2.6889 KOps/s | 2.7323 KOps/s | |
test_items_nested_leaf | 90.5510μs | 60.9913μs | 16.3958 KOps/s | 16.5023 KOps/s | |
test_items_stack_nested | 0.4096ms | 0.3711ms | 2.6944 KOps/s | 2.7604 KOps/s | |
test_items_stack_nested_leaf | 89.2110μs | 61.9345μs | 16.1461 KOps/s | 16.0982 KOps/s | |
test_items_stack_nested_locked | 0.4070ms | 0.3696ms | 2.7054 KOps/s | 2.7849 KOps/s | |
test_keys | 36.6800μs | 3.4624μs | 288.8199 KOps/s | 287.9163 KOps/s | |
test_keys_nested | 0.1296ms | 90.4288μs | 11.0584 KOps/s | 11.4132 KOps/s | |
test_keys_nested_locked | 0.7610ms | 96.0496μs | 10.4113 KOps/s | 10.7131 KOps/s | |
test_keys_nested_leaf | 0.1180ms | 81.7969μs | 12.2254 KOps/s | 12.6564 KOps/s | |
test_keys_stack_nested | 0.1368ms | 91.6382μs | 10.9125 KOps/s | 11.2892 KOps/s | |
test_keys_stack_nested_leaf | 0.1248ms | 82.8618μs | 12.0683 KOps/s | 12.4356 KOps/s | |
test_keys_stack_nested_locked | 0.1667ms | 97.3393μs | 10.2733 KOps/s | 10.5722 KOps/s | |
test_values | 7.3352μs | 0.8545μs | 1.1703 MOps/s | 1.1740 MOps/s | |
test_values_nested | 82.3510μs | 38.0295μs | 26.2954 KOps/s | 27.0352 KOps/s | |
test_values_nested_locked | 90.0610μs | 39.6182μs | 25.2409 KOps/s | 25.6445 KOps/s | |
test_values_nested_leaf | 0.1360ms | 43.2385μs | 23.1275 KOps/s | 23.6916 KOps/s | |
test_values_stack_nested | 78.9210μs | 38.4229μs | 26.0262 KOps/s | 26.4051 KOps/s | |
test_values_stack_nested_leaf | 80.0310μs | 43.4958μs | 22.9907 KOps/s | 23.3023 KOps/s | |
test_values_stack_nested_locked | 82.1410μs | 39.8142μs | 25.1167 KOps/s | 25.1088 KOps/s | |
test_membership | 2.7376μs | 0.4973μs | 2.0110 MOps/s | 1.9326 MOps/s | |
test_membership_nested | 21.9205μs | 2.0026μs | 499.3459 KOps/s | 479.0933 KOps/s | |
test_membership_nested_leaf | 22.0355μs | 2.0056μs | 498.6090 KOps/s | 494.3086 KOps/s | |
test_membership_stacked_nested | 37.0400μs | 2.0919μs | 478.0346 KOps/s | 483.9580 KOps/s | |
test_membership_stacked_nested_leaf | 58.0410μs | 2.0082μs | 497.9622 KOps/s | 487.5546 KOps/s | |
test_membership_nested_last | 26.3810μs | 3.0627μs | 326.5058 KOps/s | 329.1918 KOps/s | |
test_membership_nested_leaf_last | 27.1100μs | 3.0662μs | 326.1338 KOps/s | 325.2062 KOps/s | |
test_membership_stacked_nested_last | 33.4710μs | 3.0431μs | 328.6117 KOps/s | 331.4722 KOps/s | |
test_membership_stacked_nested_leaf_last | 24.6600μs | 3.0468μs | 328.2183 KOps/s | 330.0383 KOps/s | |
test_nested_getleaf | 44.5300μs | 6.2095μs | 161.0437 KOps/s | 161.0808 KOps/s | |
test_nested_get | 1.6559ms | 5.8221μs | 171.7607 KOps/s | 168.4594 KOps/s | |
test_stacked_getleaf | 56.9300μs | 6.0884μs | 164.2456 KOps/s | 162.2377 KOps/s | |
test_stacked_get | 34.7400μs | 5.8137μs | 172.0079 KOps/s | 171.3535 KOps/s | |
test_nested_getitemleaf | 55.5510μs | 6.4396μs | 155.2901 KOps/s | 155.1282 KOps/s | |
test_nested_getitem | 34.8400μs | 6.0807μs | 164.4557 KOps/s | 164.9809 KOps/s | |
test_stacked_getitemleaf | 68.7310μs | 6.3699μs | 156.9896 KOps/s | 156.9167 KOps/s | |
test_stacked_getitem | 29.6300μs | 5.9817μs | 167.1767 KOps/s | 166.9024 KOps/s | |
test_lock_nested | 0.4237ms | 0.3389ms | 2.9505 KOps/s | 2.8404 KOps/s | |
test_lock_stack_nested | 0.4214ms | 0.3468ms | 2.8834 KOps/s | 2.8583 KOps/s | |
test_unlock_nested | 0.3502ms | 0.2797ms | 3.5755 KOps/s | 3.5278 KOps/s | |
test_unlock_stack_nested | 0.3361ms | 0.2851ms | 3.5080 KOps/s | 3.4808 KOps/s | |
test_flatten_speed | 0.1285ms | 77.9749μs | 12.8246 KOps/s | 12.9599 KOps/s | |
test_unflatten_speed | 0.3667ms | 0.3212ms | 3.1130 KOps/s | 3.1136 KOps/s | |
test_common_ops | 0.8392ms | 0.6316ms | 1.5833 KOps/s | 1.6306 KOps/s | |
test_creation | 70.0010μs | 1.7436μs | 573.5348 KOps/s | 574.7902 KOps/s | |
test_creation_empty | 50.1600μs | 9.6865μs | 103.2366 KOps/s | 115.3093 KOps/s | |
test_creation_nested_1 | 51.1110μs | 11.3003μs | 88.4932 KOps/s | 96.0127 KOps/s | |
test_creation_nested_2 | 44.6000μs | 14.0520μs | 71.1644 KOps/s | 76.4927 KOps/s | |
test_clone | 43.2400μs | 10.2183μs | 97.8639 KOps/s | 97.3933 KOps/s | |
test_getitem[int] | 1.3206ms | 10.7960μs | 92.6265 KOps/s | 90.8035 KOps/s | |
test_getitem[slice_int] | 0.1083ms | 20.9968μs | 47.6262 KOps/s | 47.0395 KOps/s | |
test_getitem[range] | 0.1287ms | 37.1120μs | 26.9455 KOps/s | 26.6729 KOps/s | |
test_getitem[tuple] | 0.1090ms | 18.5581μs | 53.8847 KOps/s | 54.0338 KOps/s | |
test_getitem[list] | 0.1263ms | 33.1613μs | 30.1556 KOps/s | 30.5293 KOps/s | |
test_setitem_dim[int] | 40.4710μs | 19.0471μs | 52.5014 KOps/s | 52.6412 KOps/s | |
test_setitem_dim[slice_int] | 58.7910μs | 37.6305μs | 26.5742 KOps/s | 26.5589 KOps/s | |
test_setitem_dim[range] | 0.1591ms | 52.2358μs | 19.1440 KOps/s | 19.2654 KOps/s | |
test_setitem_dim[tuple] | 52.9210μs | 31.7054μs | 31.5404 KOps/s | 31.3700 KOps/s | |
test_setitem | 75.7500μs | 15.4671μs | 64.6533 KOps/s | 66.7855 KOps/s | |
test_set | 0.1235ms | 15.1258μs | 66.1124 KOps/s | 70.2554 KOps/s | |
test_set_shared | 0.5090ms | 0.1571ms | 6.3661 KOps/s | 6.3558 KOps/s | |
test_update | 0.3224ms | 18.8233μs | 53.1257 KOps/s | 57.7135 KOps/s | |
test_update_nested | 71.2110μs | 24.0439μs | 41.5906 KOps/s | 42.8498 KOps/s | |
test_update__nested | 0.6033ms | 24.8112μs | 40.3043 KOps/s | 40.9215 KOps/s | |
test_set_nested | 67.1610μs | 16.0910μs | 62.1466 KOps/s | 63.7208 KOps/s | |
test_set_nested_new | 71.1300μs | 18.7361μs | 53.3729 KOps/s | 56.4585 KOps/s | |
test_select | 63.7310μs | 30.5158μs | 32.7699 KOps/s | 33.9063 KOps/s | |
test_select_nested | 75.3910μs | 43.1402μs | 23.1802 KOps/s | 22.9900 KOps/s | |
test_exclude_nested | 94.6410μs | 62.9662μs | 15.8815 KOps/s | 15.6921 KOps/s | |
test_empty[True] | 0.3385ms | 0.2970ms | 3.3674 KOps/s | 3.4001 KOps/s | |
test_empty[False] | 3.2940μs | 0.8181μs | 1.2223 MOps/s | 1.2115 MOps/s | |
test_to | 87.3810μs | 54.2156μs | 18.4449 KOps/s | 17.9588 KOps/s | |
test_to_nonblocking | 94.7810μs | 46.6619μs | 21.4308 KOps/s | 20.2114 KOps/s | |
test_unbind_speed | 0.3076ms | 0.2432ms | 4.1119 KOps/s | 4.1295 KOps/s | |
test_unbind_speed_stack0 | 0.3166ms | 0.2440ms | 4.0980 KOps/s | 4.1384 KOps/s | |
test_unbind_speed_stack1 | 99.1234ms | 0.7414ms | 1.3487 KOps/s | 1.3450 KOps/s | |
test_split | 98.0934ms | 1.6034ms | 623.6704 Ops/s | 616.9463 Ops/s | |
test_chunk | 99.8190ms | 1.6057ms | 622.7625 Ops/s | 617.9948 Ops/s | |
test_consolidate[False-None] | 0.1042s | 2.9630ms | 337.4934 Ops/s | 339.8813 Ops/s | |
test_consolidate[default-None] | 1.9008ms | 1.7317ms | 577.4714 Ops/s | 574.6990 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8440ms | 1.7680ms | 565.6051 Ops/s | 567.9887 Ops/s | |
test_consolidate_njt[False-None] | 7.0505ms | 6.6441ms | 150.5089 Ops/s | 110.0395 Ops/s | |
test_to[False-False-None] | 1.7988ms | 1.7094ms | 585.0036 Ops/s | 585.4604 Ops/s | |
test_to[True-False-None] | 1.6482ms | 1.3593ms | 735.6723 Ops/s | 720.0552 Ops/s | |
test_to[within-False-None] | 4.2668ms | 4.1650ms | 240.0978 Ops/s | 235.5305 Ops/s | |
test_to[True-default-None] | 5.5265ms | 5.3249ms | 187.7976 Ops/s | 178.2465 Ops/s | |
test_to_njt[False-False-None] | 7.0547ms | 6.9238ms | 144.4297 Ops/s | 140.0525 Ops/s | |
test_to_njt[True-False-None] | 5.7268ms | 5.5385ms | 180.5544 Ops/s | 170.4956 Ops/s | |
test_to_njt[within-False-None] | 12.3925ms | 12.2208ms | 81.8278 Ops/s | 78.3850 Ops/s | |
test_creation[device0] | 0.5686ms | 78.4960μs | 12.7395 KOps/s | 12.0224 KOps/s | |
test_creation_from_tensor | 0.4647ms | 82.7304μs | 12.0875 KOps/s | 11.9548 KOps/s | |
test_add_one[memmap_tensor0] | 0.6762ms | 6.4945μs | 153.9762 KOps/s | 154.8223 KOps/s | |
test_contiguous[memmap_tensor0] | 1.6620μs | 0.4209μs | 2.3759 MOps/s | 2.3750 MOps/s | |
test_stack[memmap_tensor0] | 36.7010μs | 4.6563μs | 214.7622 KOps/s | 217.6906 KOps/s | |
test_memmaptd_index | 1.8166ms | 0.2428ms | 4.1188 KOps/s | 4.0221 KOps/s | |
test_memmaptd_index_astensor | 0.4287ms | 0.3050ms | 3.2789 KOps/s | 3.2310 KOps/s | |
test_memmaptd_index_op | 0.7189ms | 0.5886ms | 1.6989 KOps/s | 1.7121 KOps/s | |
test_serialize_model | 0.1325s | 0.1307s | 7.6534 Ops/s | 7.6845 Ops/s | |
test_serialize_model_pickle | 1.3492s | 1.1918s | 0.8391 Ops/s | 0.8215 Ops/s | |
test_serialize_weights | 0.1297s | 0.1292s | 7.7411 Ops/s | 7.7370 Ops/s | |
test_serialize_weights_returnearly | 0.3266s | 54.2582ms | 18.4304 Ops/s | 14.6628 Ops/s | |
test_serialize_weights_pickle | 1.3765s | 1.1906s | 0.8399 Ops/s | 0.8198 Ops/s | |
test_reshape_pytree | 51.1810μs | 22.1983μs | 45.0485 KOps/s | 44.5821 KOps/s | |
test_reshape_td | 54.6700μs | 26.8791μs | 37.2036 KOps/s | 37.3450 KOps/s | |
test_view_pytree | 54.1100μs | 21.7583μs | 45.9594 KOps/s | 44.9765 KOps/s | |
test_view_td | 61.2600μs | 32.0235μs | 31.2271 KOps/s | 31.8937 KOps/s | |
test_unbind_pytree | 58.9400μs | 27.8717μs | 35.8787 KOps/s | 35.2931 KOps/s | |
test_unbind_td | 0.8316ms | 37.3163μs | 26.7979 KOps/s | 26.3161 KOps/s | |
test_split_pytree | 61.9400μs | 29.9649μs | 33.3724 KOps/s | 32.6404 KOps/s | |
test_split_td | 0.1813ms | 39.0756μs | 25.5914 KOps/s | 25.2702 KOps/s | |
test_add_pytree | 0.1085ms | 33.1884μs | 30.1310 KOps/s | 29.2538 KOps/s | |
test_add_td | 88.0210μs | 49.4474μs | 20.2235 KOps/s | 21.2002 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1831ms | 0.1220ms | 8.1939 KOps/s | 7.8456 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2310ms | 0.1325ms | 7.5462 KOps/s | 7.4180 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1328ms | 96.2707μs | 10.3874 KOps/s | 10.4433 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3123ms | 0.1459ms | 6.8517 KOps/s | 6.6849 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 90.9910μs | 29.2345μs | 34.2062 KOps/s | 41.7629 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1117ms | 29.0740μs | 34.3950 KOps/s | 33.6405 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4307ms | 64.2771μs | 15.5576 KOps/s | 15.4645 KOps/s | |
test_compile_copy_nested[pytree-eager] | 83.1210μs | 49.0274μs | 20.3968 KOps/s | 20.2796 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1968ms | 0.1423ms | 7.0277 KOps/s | 6.9351 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3174ms | 0.2182ms | 4.5838 KOps/s | 4.5969 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1487ms | 0.1019ms | 9.8104 KOps/s | 9.7650 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1868ms | 55.1265μs | 18.1401 KOps/s | 17.2514 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2067ms | 0.1381ms | 7.2432 KOps/s | 7.3594 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5608ms | 0.4744ms | 2.1079 KOps/s | 2.0659 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3970ms | 0.2618ms | 3.8198 KOps/s | 3.8188 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2085ms | 0.1467ms | 6.8175 KOps/s | 7.0213 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.3169ms | 67.5726μs | 14.7989 KOps/s | 14.4501 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1427ms | 98.5445μs | 10.1477 KOps/s | 9.7941 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4411ms | 0.4000ms | 2.4999 KOps/s | 2.4955 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1815ms | 0.1347ms | 7.4251 KOps/s | 7.2573 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 58.4010μs | 19.1154μs | 52.3139 KOps/s | 52.8000 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 61.1810μs | 31.8149μs | 31.4318 KOps/s | 32.0451 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1049ms | 70.4893μs | 14.1866 KOps/s | 14.2393 KOps/s | |
test_compile_copy_flat[pytree-eager] | 75.2610μs | 52.4414μs | 19.0689 KOps/s | 19.2869 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6468ms | 0.3968ms | 2.5199 KOps/s | 2.1831 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.8461ms | 2.6630ms | 375.5115 Ops/s | 379.0570 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6463ms | 0.4432ms | 2.2561 KOps/s | 2.1912 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8019ms | 2.5853ms | 386.8026 Ops/s | 383.2760 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.6024ms | 0.1149ms | 8.7024 KOps/s | 8.4921 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5723ms | 78.3637μs | 12.7610 KOps/s | 11.9512 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5404ms | 0.1107ms | 9.0348 KOps/s | 8.8933 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1280ms | 68.6892μs | 14.5583 KOps/s | 13.7281 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1702ms | 0.1125ms | 8.8862 KOps/s | 8.9444 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1190ms | 69.3842μs | 14.4125 KOps/s | 14.0474 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1668ms | 0.1038ms | 9.6377 KOps/s | 9.7619 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1455ms | 17.8264μs | 56.0966 KOps/s | 41.0155 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1439ms | 94.9702μs | 10.5296 KOps/s | 10.1098 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 53.9800μs | 15.7298μs | 63.5736 KOps/s | 62.8700 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2075ms | 97.4777μs | 10.2588 KOps/s | 10.2674 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 53.0500μs | 15.7474μs | 63.5027 KOps/s | 63.0940 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2328ms | 0.1033ms | 9.6829 KOps/s | 9.8698 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6231ms | 17.1078μs | 58.4530 KOps/s | 53.2502 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1454ms | 96.4676μs | 10.3662 KOps/s | 9.9652 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 56.4510μs | 16.3072μs | 61.3225 KOps/s | 62.9382 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1465ms | 95.5921μs | 10.4611 KOps/s | 10.3645 KOps/s | |
test_compile_indexing[int-pytree-eager] | 50.6900μs | 15.7573μs | 63.4626 KOps/s | 63.0276 KOps/s | |
test_mod_add[eager] | 89.9710μs | 39.9762μs | 25.0149 KOps/s | 25.9262 KOps/s | |
test_mod_add[compile] | 0.3747ms | 82.3264μs | 12.1468 KOps/s | 11.9209 KOps/s | |
test_mod_add[compile-overhead] | 0.3304ms | 0.1695ms | 5.8997 KOps/s | 5.6851 KOps/s | |
test_mod_wrap[eager] | 0.3289ms | 0.2461ms | 4.0631 KOps/s | 3.9898 KOps/s | |
test_mod_wrap[compile] | 0.4081ms | 0.2867ms | 3.4880 KOps/s | 3.4486 KOps/s | |
test_mod_wrap[compile-overhead] | 7.2074ms | 3.8668ms | 258.6118 Ops/s | 258.3991 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4809ms | 1.3298ms | 752.0208 Ops/s | 702.1843 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4198ms | 1.2718ms | 786.2841 Ops/s | 725.1513 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3618ms | 0.9192ms | 1.0879 KOps/s | 933.3138 Ops/s | |
test_seq_add[eager] | 0.1945ms | 0.1180ms | 8.4722 KOps/s | 8.1016 KOps/s | |
test_seq_add[compile] | 0.2094ms | 90.5748μs | 11.0406 KOps/s | 10.7896 KOps/s | |
test_seq_add[compile-overhead] | 0.2405ms | 0.1286ms | 7.7758 KOps/s | 7.7171 KOps/s | |
test_seq_wrap[eager] | 0.4997ms | 0.4215ms | 2.3722 KOps/s | 2.3567 KOps/s | |
test_seq_wrap[compile] | 0.3705ms | 0.3014ms | 3.3175 KOps/s | 3.2847 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2851ms | 0.2352ms | 4.2512 KOps/s | 4.3912 KOps/s | |
test_func_call_runtime[False-eager] | 0.7900ms | 0.7222ms | 1.3846 KOps/s | 1.3626 KOps/s | |
test_func_call_runtime[False-compile] | 0.8468ms | 0.7540ms | 1.3262 KOps/s | 1.3160 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4187ms | 0.3669ms | 2.7255 KOps/s | 2.7438 KOps/s | |
test_func_call_runtime[True-eager] | 0.9646ms | 0.8852ms | 1.1296 KOps/s | 1.1148 KOps/s | |
test_func_call_runtime[True-compile] | 0.8196ms | 0.7751ms | 1.2901 KOps/s | 1.2710 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5609ms | 0.3908ms | 2.5588 KOps/s | 2.6100 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7734ms | 0.7180ms | 1.3927 KOps/s | 1.3309 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8027ms | 0.7557ms | 1.3233 KOps/s | 1.2631 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4634ms | 0.3710ms | 2.6954 KOps/s | 2.7280 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1637ms | 0.9871ms | 1.0131 KOps/s | 991.1955 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.0954ms | 0.9794ms | 1.0210 KOps/s | 1.0183 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0402ms | 0.9747ms | 1.0260 KOps/s | 1.0068 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4283ms | 2.0192ms | 495.2370 Ops/s | 473.3457 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9074ms | 0.8169ms | 1.2241 KOps/s | 1.2042 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4652ms | 0.4178ms | 2.3937 KOps/s | 2.3760 KOps/s | |
test_distributed | 3.0257ms | 0.1339ms | 7.4679 KOps/s | 8.6555 KOps/s | |
test_tdmodule | 34.2810μs | 20.7789μs | 48.1258 KOps/s | 48.0399 KOps/s | |
test_tdmodule_dispatch | 68.0600μs | 37.3637μs | 26.7640 KOps/s | 27.3776 KOps/s | |
test_tdseq | 42.2800μs | 21.7161μs | 46.0487 KOps/s | 47.5549 KOps/s | |
test_tdseq_dispatch | 64.6910μs | 40.7222μs | 24.5566 KOps/s | 25.5880 KOps/s | |
test_instantiation_functorch | 1.7330ms | 1.5552ms | 643.0157 Ops/s | 646.8802 Ops/s | |
test_exec_functorch | 0.1810ms | 0.1406ms | 7.1129 KOps/s | 7.1785 KOps/s | |
test_exec_functional_call | 0.1948ms | 0.1311ms | 7.6251 KOps/s | 7.6282 KOps/s | |
test_exec_td_decorator | 0.3645ms | 0.1824ms | 5.4830 KOps/s | 5.4750 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7498ms | 0.6695ms | 1.4936 KOps/s | 1.4815 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8345ms | 0.6709ms | 1.4905 KOps/s | 1.4800 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7046ms | 0.5766ms | 1.7343 KOps/s | 1.6889 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7045ms | 0.5806ms | 1.7223 KOps/s | 1.6940 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.4655ms | 18.7019ms | 53.4706 Ops/s | 52.7047 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.5013ms | 18.7693ms | 53.2784 Ops/s | 52.6247 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.4721ms | 18.8569ms | 53.0310 Ops/s | 53.6276 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.5668ms | 18.6320ms | 53.6712 Ops/s | 53.6836 Ops/s | |
test_to_module_speed[True] | 1.0483ms | 0.9625ms | 1.0389 KOps/s | 1.0340 KOps/s | |
test_to_module_speed[False] | 1.0456ms | 0.9555ms | 1.0465 KOps/s | 1.0433 KOps/s | |
test_tc_init | 84.5000μs | 37.7325μs | 26.5023 KOps/s | 27.7671 KOps/s | |
test_tc_init_nested | 0.1184ms | 77.5276μs | 12.8986 KOps/s | 13.7909 KOps/s | |
test_tc_first_layer_tensor | 23.6600μs | 0.7839μs | 1.2757 MOps/s | 1.2473 MOps/s | |
test_tc_first_layer_nontensor | 23.1900μs | 2.2157μs | 451.3192 KOps/s | 447.6502 KOps/s | |
test_tc_second_layer_tensor | 7.6050μs | 1.4145μs | 706.9622 KOps/s | 715.1335 KOps/s | |
test_tc_second_layer_nontensor | 41.6900μs | 2.9041μs | 344.3354 KOps/s | 336.7748 KOps/s | |
test_unbind | 0.2204s | 10.0904ms | 99.1043 Ops/s | 144.0928 Ops/s | |
test_full_like | 11.0551ms | 9.8805ms | 101.2094 Ops/s | 100.8884 Ops/s | |
test_zeros_like | 5.0804ms | 4.4425ms | 225.0965 Ops/s | 224.9536 Ops/s | |
test_ones_like | 5.0515ms | 4.4595ms | 224.2403 Ops/s | 222.0583 Ops/s | |
test_clone | 12.7580ms | 9.6533ms | 103.5912 Ops/s | 143.9361 Ops/s | |
test_squeeze | 60.7110μs | 9.9005μs | 101.0048 KOps/s | 95.0103 KOps/s | |
test_unsqueeze | 0.1617ms | 73.5280μs | 13.6003 KOps/s | 13.2140 KOps/s | |
test_split | 0.3584ms | 0.1587ms | 6.3016 KOps/s | 5.9974 KOps/s | |
test_permute | 0.2305ms | 0.1835ms | 5.4490 KOps/s | 5.3809 KOps/s | |
test_stack | 53.0815ms | 51.7160ms | 19.3364 Ops/s | 19.5112 Ops/s | |
test_cat | 53.1732ms | 51.6233ms | 19.3711 Ops/s | 19.4805 Ops/s |
vmoens
added a commit
that referenced
this pull request
Feb 17, 2025
ghstack-source-id: aeaa6981081b45430f6664530f3dab7cc3d21759 Pull Request resolved: #1219
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):