Class TextDataset

java.lang.Object
io.github.kirstenali.deepj.data.TextDataset

public final class TextDataset extends Object
Simple in-memory dataset that samples random contiguous chunks from token ids. This is intentionally minimal but correct; for large corpora, replace with a memory-mapped or streaming implementation.