0
0
Fork 0
mirror of https://github.com/matrix-construct/construct synced 2025-01-25 05:49:58 +01:00
Commit graph

34 commits

Author SHA1 Message Date
Jason Volk
a158c214a9 ircd::simt: Add hardware ident access register (AMDGCN). 2023-01-02 19:42:21 +00:00
Jason Volk
2b5f5f7a95 ircd::simt: Add preliminary memory characteristics tool. 2023-01-02 19:42:21 +00:00
Jason Volk
38c4959f6c ircd::simt: Add macro to differentiate RDNA over GCN. 2023-01-02 19:42:21 +00:00
Jason Volk
d377674748 ircd::simt: Split vector reduce_add to hadd. 2023-01-01 19:16:06 -08:00
Jason Volk
1f87668a28 ircd::simt: Add cycles_rtc() intrinsic for s_memrealtime on (AMDGCN). 2023-01-01 21:28:54 +00:00
Jason Volk
47e54547f9 ircd::simt: Add optimizations from cl_khr_subgroups support. 2023-01-01 21:28:54 +00:00
Jason Volk
0fb7577066 ircd::simt: Split is_pow() util to math header for now. 2023-01-01 21:28:54 +00:00
Jason Volk
b068b7d959 ircd::simt: Add portables for static_assert macro. 2022-11-07 01:51:19 +00:00
Jason Volk
561be9973a ircd::simt::norm: Barrier for overlapping input and output buffers. (gfx1011) 2022-11-03 18:28:53 +00:00
Jason Volk
7c8163fcab ircd::simt: Alias asm to OpenCL __asm__. 2022-11-03 18:28:53 +00:00
Jason Volk
ac163274f1 ircd::simt: Simplify keywording; tease out OpenCL version by target. 2022-10-30 18:44:50 +00:00
Jason Volk
a50f9fbd1b ircd::simt: Improve assert related; fix 83114172 argument name. 2022-10-18 22:01:35 +00:00
Jason Volk
6a05fcefeb ircd::simt: Consolidate timestamp counter sampling into inline. 2022-10-12 18:53:53 -07:00
Jason Volk
54e3b8f5b4 ircd::simt: Consolidate portables and macros from units into headers. 2022-10-11 02:13:46 +00:00
Jason Volk
c0f0398b45 ircd::simt: Add inline to clear the mean state to zero. 2022-10-05 13:53:52 -07:00
Jason Volk
31e078506a ircd::simt: Store divisor in mean state for now. 2022-06-19 20:14:22 -07:00
Jason Volk
d817536e23 ircd::simt: Use defined epsilon for normalization smoothing for now. 2022-06-17 21:11:53 -07:00
Jason Volk
b3dc9a9ac3 ircd::simt: Fixes for non-pow2 workgroup sizes. 2022-06-17 21:11:53 -07:00
Jason Volk
2b30d775b9 ircd::simt: Add intrinsic integer max. 2022-06-17 21:11:53 -07:00
Jason Volk
55004f054a ircd::simt: Use native_sqrt(); minor cleanup / named statement. 2022-06-17 21:11:53 -07:00
Jason Volk
d733fe8ef9 ircd::simt: Add overloads for scalar types for now. 2022-06-17 21:11:53 -07:00
Jason Volk
5e824a4f6f ircd::simt: Simplify LDS broadcast. 2022-06-17 21:11:53 -07:00
Jason Volk
12f3eb871d ircd::simt: Fix buffer and ordering hokey-pokey. 2022-06-17 21:11:53 -07:00
Jason Volk
38f6507434 ircd::simt: Relax define guards for stdclc++ compat. 2022-06-17 21:11:52 -07:00
Jason Volk
a85f192066 ircd::simt: Move workgroup ident values to arguments. 2022-06-17 21:11:52 -07:00
Jason Volk
422c4da624 ircd::simt: Add a local variant for prng. 2021-10-06 13:13:47 -07:00
Jason Volk
f1051cf56b ircd::simt: Implement bitonic sort; replace stub. 2021-09-15 01:44:36 -07:00
Jason Volk
3e9c2d1b56 ircd::simt: Abstract the three-piece softmax, mean state related. 2021-05-11 20:56:09 -07:00
Jason Volk
5e91d51e6a ircd::simt: Add dressings for cross-inclusion. 2021-05-11 20:56:09 -07:00
Jason Volk
bc98835b3c ircd::simt: Simplify interface; internalize workitem functions. 2021-04-22 12:27:57 -07:00
Jason Volk
075b40400a ircd::simt: Split reduce tools; add float4 horizontal add. 2021-04-22 12:27:57 -07:00
Jason Volk
de166dfab8 ircd::simt: Simplify various loop/mask conditionals; minor cleanup. 2021-04-22 12:27:57 -07:00
Jason Volk
9c062d9c3f ircd::gpt::pipe: Correctness; compute loss, statistics; pipeline optimize. 2021-04-22 12:27:57 -07:00
Jason Volk
5962a58c1e ircd::simt: Start a CLC toolchest. 2021-04-22 12:27:56 -07:00