Jason Volk
|
bd460a80ce
|
ircd::gpt::pipe::model: Additional sub-buffering for lmhead.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
d5eb1e3a87
|
ircd::gpt: Add backpropagation pipe.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
14a1561cad
|
ircd::gpt: Add adaptive moment state to pipe model; move task prelim defs.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
eba5d7f586
|
ircd::gpt: Improve build related; debug/assertions.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
9c062d9c3f
|
ircd::gpt::pipe: Correctness; compute loss, statistics; pipeline optimize.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
1870c364f4
|
ircd::gpt: Reorganize interface.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
29e74ec9e1
|
ircd::gpt: Transformer Pipe.
|
2021-04-22 12:27:56 -07:00 |
|
Jason Volk
|
29fb7910b7
|
ircd::gpt: Model structural tweaks; task structure; various.
|
2021-04-22 12:27:56 -07:00 |
|
Jason Volk
|
4da7d2ae43
|
ircd::gpt: Add task struct; mmap cached model directly; improve init.
|
2021-03-16 12:58:22 -07:00 |
|
Jason Volk
|
734948863f
|
ircd::gpt::vocab: Add token debug string tool.
|
2021-03-09 04:50:19 -08:00 |
|
Jason Volk
|
33a1ffd4bf
|
ircd::gpt: Add basic interface; add options, context.
|
2021-03-09 04:50:19 -08:00 |
|
Jason Volk
|
4458235dfa
|
ircd::gpt: More Matrix Than Matrix.
|
2021-03-09 04:50:19 -08:00 |
|
Jason Volk
|
53c4260a21
|
ircd::gpt: Add Basic Latin (lower) and C0 replacement LUT; various.
|
2021-03-09 04:50:19 -08:00 |
|
Jason Volk
|
29b99dcf4d
|
ircd::gpt: Split vocab related into separate unit.
|
2021-03-02 11:13:59 -08:00 |
|
Jason Volk
|
57f9d3fdfb
|
ircd::gpt: Add vocabulary tokenization; byte-pair merge encoding for natural language.
|
2021-02-26 14:52:33 -08:00 |
|