get_batch: potential off-by-one in torch.randint upper bound

In get_batch, the random starting index is sampled as:
```python
ix = torch.randint(len(data) - block_size, (batch_size,))
```
But y is sliced as:
```python
y = torch.stack([data[i + 1: i + block_size + 1] for i in ix])
```
When i equals len(data) - block_size - 1 (the max value from randint), y needs to reach index len(data), which is one past the end. Python slicing clips silently so it won't crash, but the last chunk in y could be 1 token short.
Suggested fix:
```python
ix = torch.randint(len(data) - block_size - 1, (batch_size,))
```
This ensures every sampled chunk has enough room for both x and y.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

get_batch: potential off-by-one in torch.randint upper bound #56

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

get_batch: potential off-by-one in torch.randint upper bound #56

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions