Jason Volk
|
d0cbf6c14c
|
ircd::simt: Add hardware ident access register (AMDDNA)
|
2023-01-02 20:25:58 +00:00 |
|
Jason Volk
|
a158c214a9
|
ircd::simt: Add hardware ident access register (AMDGCN).
|
2023-01-02 19:42:21 +00:00 |
|
Jason Volk
|
2b5f5f7a95
|
ircd::simt: Add preliminary memory characteristics tool.
|
2023-01-02 19:42:21 +00:00 |
|
Jason Volk
|
38c4959f6c
|
ircd::simt: Add macro to differentiate RDNA over GCN.
|
2023-01-02 19:42:21 +00:00 |
|
Jason Volk
|
d377674748
|
ircd::simt: Split vector reduce_add to hadd.
|
2023-01-01 19:16:06 -08:00 |
|
Jason Volk
|
1f87668a28
|
ircd::simt: Add cycles_rtc() intrinsic for s_memrealtime on (AMDGCN).
|
2023-01-01 21:28:54 +00:00 |
|
Jason Volk
|
47e54547f9
|
ircd::simt: Add optimizations from cl_khr_subgroups support.
|
2023-01-01 21:28:54 +00:00 |
|
Jason Volk
|
0fb7577066
|
ircd::simt: Split is_pow() util to math header for now.
|
2023-01-01 21:28:54 +00:00 |
|
Jason Volk
|
b068b7d959
|
ircd::simt: Add portables for static_assert macro.
|
2022-11-07 01:51:19 +00:00 |
|
Jason Volk
|
561be9973a
|
ircd::simt::norm: Barrier for overlapping input and output buffers. (gfx1011)
|
2022-11-03 18:28:53 +00:00 |
|
Jason Volk
|
7c8163fcab
|
ircd::simt: Alias asm to OpenCL __asm__.
|
2022-11-03 18:28:53 +00:00 |
|
Jason Volk
|
ac163274f1
|
ircd::simt: Simplify keywording; tease out OpenCL version by target.
|
2022-10-30 18:44:50 +00:00 |
|
Jason Volk
|
a50f9fbd1b
|
ircd::simt: Improve assert related; fix 83114172 argument name.
|
2022-10-18 22:01:35 +00:00 |
|
Jason Volk
|
6a05fcefeb
|
ircd::simt: Consolidate timestamp counter sampling into inline.
|
2022-10-12 18:53:53 -07:00 |
|
Jason Volk
|
54e3b8f5b4
|
ircd::simt: Consolidate portables and macros from units into headers.
|
2022-10-11 02:13:46 +00:00 |
|
Jason Volk
|
c0f0398b45
|
ircd::simt: Add inline to clear the mean state to zero.
|
2022-10-05 13:53:52 -07:00 |
|
Jason Volk
|
31e078506a
|
ircd::simt: Store divisor in mean state for now.
|
2022-06-19 20:14:22 -07:00 |
|
Jason Volk
|
d817536e23
|
ircd::simt: Use defined epsilon for normalization smoothing for now.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
b3dc9a9ac3
|
ircd::simt: Fixes for non-pow2 workgroup sizes.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
2b30d775b9
|
ircd::simt: Add intrinsic integer max.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
55004f054a
|
ircd::simt: Use native_sqrt(); minor cleanup / named statement.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
d733fe8ef9
|
ircd::simt: Add overloads for scalar types for now.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
5e824a4f6f
|
ircd::simt: Simplify LDS broadcast.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
12f3eb871d
|
ircd::simt: Fix buffer and ordering hokey-pokey.
|
2022-06-17 21:11:53 -07:00 |
|
Jason Volk
|
38f6507434
|
ircd::simt: Relax define guards for stdclc++ compat.
|
2022-06-17 21:11:52 -07:00 |
|
Jason Volk
|
a85f192066
|
ircd::simt: Move workgroup ident values to arguments.
|
2022-06-17 21:11:52 -07:00 |
|
Jason Volk
|
422c4da624
|
ircd::simt: Add a local variant for prng.
|
2021-10-06 13:13:47 -07:00 |
|
Jason Volk
|
f1051cf56b
|
ircd::simt: Implement bitonic sort; replace stub.
|
2021-09-15 01:44:36 -07:00 |
|
Jason Volk
|
3e9c2d1b56
|
ircd::simt: Abstract the three-piece softmax, mean state related.
|
2021-05-11 20:56:09 -07:00 |
|
Jason Volk
|
5e91d51e6a
|
ircd::simt: Add dressings for cross-inclusion.
|
2021-05-11 20:56:09 -07:00 |
|
Jason Volk
|
bc98835b3c
|
ircd::simt: Simplify interface; internalize workitem functions.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
075b40400a
|
ircd::simt: Split reduce tools; add float4 horizontal add.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
de166dfab8
|
ircd::simt: Simplify various loop/mask conditionals; minor cleanup.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
9c062d9c3f
|
ircd::gpt::pipe: Correctness; compute loss, statistics; pipeline optimize.
|
2021-04-22 12:27:57 -07:00 |
|
Jason Volk
|
5962a58c1e
|
ircd::simt: Start a CLC toolchest.
|
2021-04-22 12:27:56 -07:00 |
|