Commit e61b839
committed
mm: sync the last stream in the queue, not the next
Currently this peeks ahead to sync the next stream in the queue of
streams with the compute stream. This doesnt allow a lot of
parallelization, as then end result is you can only get one weight load
ahead regardless of how many streams you have.
Rotate the loop logic here to synchronize the end of the queue before
returning the next stream. This allows weights to be loaded ahead of the
compute streams position.1 parent f2930c1 commit e61b839
1 file changed
+3
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1031 | 1031 | | |
1032 | 1032 | | |
1033 | 1033 | | |
1034 | | - | |
1035 | | - | |
| 1034 | + | |
1036 | 1035 | | |
| 1036 | + | |
1037 | 1037 | | |
1038 | | - | |
| 1038 | + | |
1039 | 1039 | | |
1040 | 1040 | | |
1041 | 1041 | | |
1042 | 1042 | | |
1043 | 1043 | | |
1044 | 1044 | | |
1045 | | - | |
1046 | 1045 | | |
1047 | 1046 | | |
1048 | 1047 | | |
| |||
1051 | 1050 | | |
1052 | 1051 | | |
1053 | 1052 | | |
1054 | | - | |
1055 | 1053 | | |
1056 | 1054 | | |
1057 | 1055 | | |
| |||
0 commit comments