Jason Volk
|
ce9abfb321
|
ircd::gpt::model: Optimize left-attention mask.
|
2021-10-06 13:13:47 -07:00 |
|
Jason Volk
|
9ca95591ff
|
ircd::gpt::model: Improve cache open mode related.
|
2021-09-15 02:37:45 -07:00 |
|
Jason Volk
|
3859a13293
|
ircd::gpt::model: Add conf items for model/cache opts related.
|
2021-09-14 23:39:55 -07:00 |
|
Jason Volk
|
2a3c54afa2
|
ircd::gpt::pipe: Add model invalidation for device resynchronization.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
d5eb1e3a87
|
ircd::gpt: Add backpropagation pipe.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
9c062d9c3f
|
ircd::gpt::pipe: Correctness; compute loss, statistics; pipeline optimize.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
1870c364f4
|
ircd::gpt: Reorganize interface.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
29fb7910b7
|
ircd::gpt: Model structural tweaks; task structure; various.
|
2021-04-22 12:27:56 -07:00 |
|
Jason Volk
|
4da7d2ae43
|
ircd::gpt: Add task struct; mmap cached model directly; improve init.
|
2021-03-16 12:58:22 -07:00 |
|
Jason Volk
|
4458235dfa
|
ircd::gpt: More Matrix Than Matrix.
|
2021-03-09 04:50:19 -08:00 |
|