0
0
Fork 0
mirror of https://github.com/matrix-construct/construct synced 2024-11-14 05:51:10 +01:00
Commit graph

14752 commits

Author SHA1 Message Date
Jason Volk
d0cbf6c14c ircd::simt: Add hardware ident access register (AMDDNA) 2023-01-02 20:25:58 +00:00
Jason Volk
a158c214a9 ircd::simt: Add hardware ident access register (AMDGCN). 2023-01-02 19:42:21 +00:00
Jason Volk
2b5f5f7a95 ircd::simt: Add preliminary memory characteristics tool. 2023-01-02 19:42:21 +00:00
Jason Volk
38c4959f6c ircd::simt: Add macro to differentiate RDNA over GCN. 2023-01-02 19:42:21 +00:00
Jason Volk
d377674748 ircd::simt: Split vector reduce_add to hadd. 2023-01-01 19:16:06 -08:00
Jason Volk
1f87668a28 ircd::simt: Add cycles_rtc() intrinsic for s_memrealtime on (AMDGCN). 2023-01-01 21:28:54 +00:00
Jason Volk
a53963fc51 ircd::math: Add difference template convenience. 2023-01-01 21:28:54 +00:00
Jason Volk
47e54547f9 ircd::simt: Add optimizations from cl_khr_subgroups support. 2023-01-01 21:28:54 +00:00
Jason Volk
0fb7577066 ircd::simt: Split is_pow() util to math header for now. 2023-01-01 21:28:54 +00:00
Jason Volk
521a04730c ircd:🆑 Conf items for device queueing and queue size defaults. 2023-01-01 21:28:54 +00:00
Jason Volk
ee52a9f24b modules/federation/invite: Fix invite request routing to proper handler version. 2022-11-19 15:21:04 -08:00
Jason Volk
517f109f3a ircd:🆑:exec: Override local size from reqd_work_group_size attribute. 2022-11-07 01:51:19 +00:00
Jason Volk
b068b7d959 ircd::simt: Add portables for static_assert macro. 2022-11-07 01:51:19 +00:00
Jason Volk
1148532894 ircd: Add shorthand to unpredictable branch attribute. 2022-11-07 01:51:19 +00:00
Jason Volk
1fd8e9954e configure: Add AMD GPU Performance API support. 2022-11-07 01:51:19 +00:00
Jason Volk
a5fb379aee configure: Configurations for AMD ROCm support. 2022-11-03 18:28:53 +00:00
Jason Volk
231a5305ca ircd:🆑 Additional device banner queries; improve formatting. 2022-11-03 18:28:53 +00:00
Jason Volk
110d4e7b17 ircd::gpt: Remove hostside backprop branch for now; simplify sample tokenizer. 2022-11-03 18:28:53 +00:00
Jason Volk
d5dc477de5 ircd: Minor fix makefile special recipe ordering. 2022-11-03 18:28:53 +00:00
Jason Volk
e8aea6f727 ircd:🆑 Optimize warp size query on AMD platforms supporting CL extension. 2022-11-03 18:28:53 +00:00
Jason Volk
35a994bafb ircd:🆑 Tweak queue properties for profiling exclusive of device queues. 2022-11-03 18:28:53 +00:00
Jason Volk
db549a64ca ircd::gpt: Add values for custom timestamp range to ctrl prof block. 2022-11-03 18:28:53 +00:00
Jason Volk
571271c6a6 ircd::gpt::pipe::code: Simplify source construction. 2022-11-03 18:28:53 +00:00
Jason Volk
601a7a3a6a ircd:🆑:code: Add construction from files and directories. 2022-11-03 18:28:53 +00:00
Jason Volk
5aeadc2d52 ircd:🆑:code: Split createProgram related out of ctors; static iov_max. 2022-11-03 18:28:53 +00:00
Jason Volk
d95718ed6f ircd::fs: Additional convenience tools for path suite. 2022-11-03 18:28:53 +00:00
Jason Volk
561be9973a ircd::simt::norm: Barrier for overlapping input and output buffers. (gfx1011) 2022-11-03 18:28:53 +00:00
Jason Volk
7c8163fcab ircd::simt: Alias asm to OpenCL __asm__. 2022-11-03 18:28:53 +00:00
Jason Volk
071e108eac ircd: Add amdgcn--amdhsa build recipes. 2022-11-03 18:28:53 +00:00
Jason Volk
ac163274f1 ircd::simt: Simplify keywording; tease out OpenCL version by target. 2022-10-30 18:44:50 +00:00
Jason Volk
e89703aa97 ircd::gpt::gpu: Limit dispatch to the number of control frame buffers. 2022-10-30 18:44:50 +00:00
Jason Volk
a50f9fbd1b ircd::simt: Improve assert related; fix 83114172 argument name. 2022-10-18 22:01:35 +00:00
Jason Volk
2609c21913 ircd::gpt::pipe: Enable mutable model; fixes for backpropagation; range stub. 2022-10-18 22:01:35 +00:00
Jason Volk
47117dde9a ircd::gpt::model: Fixes for amdgpu mmaps; reenable mapping by default. 2022-10-16 21:12:29 +00:00
Jason Volk
1ac69df8cd ircd::gpt::pipe::range: Use opts for construction. 2022-10-16 01:34:03 +00:00
Jason Volk
bc6b7486ba modules/web_hook: Update formatting for line-numbered pre tags. 2022-10-12 20:03:01 -07:00
Jason Volk
e8dad47247 ircd::gpt::model: Support mmap'ed and non-mmap'ed models based on conf. 2022-10-12 18:54:04 -07:00
Jason Volk
222d8404f1 ircd::server: Bail out of resolve handlers in case of unexpected callbacks. 2022-10-12 18:54:04 -07:00
Jason Volk
d750958f4c modules/client/sync/rooms/unread_notifications: Upgrade opts w/ designated init ctor. 2022-10-12 18:53:53 -07:00
Jason Volk
6a05fcefeb ircd::simt: Consolidate timestamp counter sampling into inline. 2022-10-12 18:53:53 -07:00
Jason Volk
54e3b8f5b4 ircd::simt: Consolidate portables and macros from units into headers. 2022-10-11 02:13:46 +00:00
Jason Volk
831141727b ircd::gpt::gpu: Add assert macro when trapping supported. 2022-10-09 03:29:29 +00:00
Jason Volk
c1168fcc30 ircd::gpt: Resolve behavior of opts.limit: 0=analysis, -n=unlimited, n=limited. 2022-10-09 03:13:09 +00:00
Jason Volk
1f57c224c6 ircd: Fix missing __has_builtin; minor cleanup. 2022-10-09 00:40:47 +00:00
Jason Volk
df522ef8e9 modules/client/search: Add cmd for limit override to workaround pagination bug in Riot. 2022-10-08 16:00:32 -07:00
Jason Volk
5294e607bd modules/client/search: Propagate exceptions from room handler. 2022-10-08 15:39:22 -07:00
Jason Volk
6fb9cd7bf1 ircd::tokens: Relax noexcept on token result closures. 2022-10-08 15:37:14 -07:00
Jason Volk
adc7f16831 modules/client/search: Allow empty search term as wildcard match for given filter keys. 2022-10-08 11:22:09 -07:00
Jason Volk
a9b22c9052 ircd::lex: Handle js null as falsy boolean. 2022-10-08 10:49:41 -07:00
Jason Volk
4c44125d37 ircd::gpt::samp: Split dispatch branch into memfun. 2022-10-06 22:28:23 +00:00