Class GPTModel
java.lang.Object
io.github.kirstenali.deepj.models.gpt.GPTModel
- All Implemented Interfaces:
Persistable,Trainable
Minimal GPT-style decoder-only transformer for educational/training use.
CPU-only and dependency-free; intended for small models.
-
Constructor Details
-
GPTModel
-
-
Method Details
-
forward
-
backward
-
parameters
- Specified by:
parametersin interfacePersistable- Specified by:
parametersin interfaceTrainable
-
gradClipNorm
public double gradClipNorm()
-