Jason Volk
|
2a3c54afa2
|
ircd::gpt::pipe: Add model invalidation for device resynchronization.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
bd460a80ce
|
ircd::gpt::pipe::model: Additional sub-buffering for lmhead.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
b76d1f32ee
|
ircd::gpt::pipe: Improve self attention; various.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
d5eb1e3a87
|
ircd::gpt: Add backpropagation pipe.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
14a1561cad
|
ircd::gpt: Add adaptive moment state to pipe model; move task prelim defs.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
eba5d7f586
|
ircd::gpt: Improve build related; debug/assertions.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
9c062d9c3f
|
ircd::gpt::pipe: Correctness; compute loss, statistics; pipeline optimize.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
1870c364f4
|
ircd::gpt: Reorganize interface.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
29e74ec9e1
|
ircd::gpt: Transformer Pipe.
|
2021-04-22 12:27:56 -07:00 |
|