0
0
Fork 0
mirror of https://github.com/matrix-construct/construct synced 2024-06-10 14:08:56 +02:00
Commit graph

5889 commits

Author SHA1 Message Date
Jason Volk 6bbde81b44 Ensure EXTRA_*FLAGS is properly appended after other build flags. 2023-01-18 08:11:02 -08:00
Jason Volk 310053abcf ircd::logger: Appease the array-bounds bean counter. 2023-01-15 22:32:00 -08:00
Jason Volk feb7524fe0 ircd::db::database::events: Use explicitly defined non-sequence index args. 2023-01-15 22:32:00 -08:00
Jason Volk cbfbe51c35 ircd::net: Replace ifdef w/ constexpr for inclusion. 2023-01-15 22:32:00 -08:00
Jason Volk f516b86e4d configure: Eliminate clang runtime profiling infrastructure. 2023-01-14 16:44:47 -08:00
Jason Volk ef23d1e749 ircd::server: Improve exception format strings; add chunk count. 2023-01-13 19:37:49 -08:00
Jason Volk d377674748 ircd::simt: Split vector reduce_add to hadd. 2023-01-01 19:16:06 -08:00
Jason Volk 521a04730c ircd:🆑 Conf items for device queueing and queue size defaults. 2023-01-01 21:28:54 +00:00
Jason Volk 517f109f3a ircd:🆑:exec: Override local size from reqd_work_group_size attribute. 2022-11-07 01:51:19 +00:00
Jason Volk 231a5305ca ircd:🆑 Additional device banner queries; improve formatting. 2022-11-03 18:28:53 +00:00
Jason Volk 110d4e7b17 ircd::gpt: Remove hostside backprop branch for now; simplify sample tokenizer. 2022-11-03 18:28:53 +00:00
Jason Volk d5dc477de5 ircd: Minor fix makefile special recipe ordering. 2022-11-03 18:28:53 +00:00
Jason Volk e8aea6f727 ircd:🆑 Optimize warp size query on AMD platforms supporting CL extension. 2022-11-03 18:28:53 +00:00
Jason Volk 35a994bafb ircd:🆑 Tweak queue properties for profiling exclusive of device queues. 2022-11-03 18:28:53 +00:00
Jason Volk db549a64ca ircd::gpt: Add values for custom timestamp range to ctrl prof block. 2022-11-03 18:28:53 +00:00
Jason Volk 571271c6a6 ircd::gpt::pipe::code: Simplify source construction. 2022-11-03 18:28:53 +00:00
Jason Volk 601a7a3a6a ircd:🆑:code: Add construction from files and directories. 2022-11-03 18:28:53 +00:00
Jason Volk 5aeadc2d52 ircd:🆑:code: Split createProgram related out of ctors; static iov_max. 2022-11-03 18:28:53 +00:00
Jason Volk d95718ed6f ircd::fs: Additional convenience tools for path suite. 2022-11-03 18:28:53 +00:00
Jason Volk 561be9973a ircd::simt::norm: Barrier for overlapping input and output buffers. (gfx1011) 2022-11-03 18:28:53 +00:00
Jason Volk 071e108eac ircd: Add amdgcn--amdhsa build recipes. 2022-11-03 18:28:53 +00:00
Jason Volk ac163274f1 ircd::simt: Simplify keywording; tease out OpenCL version by target. 2022-10-30 18:44:50 +00:00
Jason Volk e89703aa97 ircd::gpt::gpu: Limit dispatch to the number of control frame buffers. 2022-10-30 18:44:50 +00:00
Jason Volk 2609c21913 ircd::gpt::pipe: Enable mutable model; fixes for backpropagation; range stub. 2022-10-18 22:01:35 +00:00
Jason Volk 47117dde9a ircd::gpt::model: Fixes for amdgpu mmaps; reenable mapping by default. 2022-10-16 21:12:29 +00:00
Jason Volk 1ac69df8cd ircd::gpt::pipe::range: Use opts for construction. 2022-10-16 01:34:03 +00:00
Jason Volk e8dad47247 ircd::gpt::model: Support mmap'ed and non-mmap'ed models based on conf. 2022-10-12 18:54:04 -07:00
Jason Volk 222d8404f1 ircd::server: Bail out of resolve handlers in case of unexpected callbacks. 2022-10-12 18:54:04 -07:00
Jason Volk 6a05fcefeb ircd::simt: Consolidate timestamp counter sampling into inline. 2022-10-12 18:53:53 -07:00
Jason Volk 54e3b8f5b4 ircd::simt: Consolidate portables and macros from units into headers. 2022-10-11 02:13:46 +00:00
Jason Volk 831141727b ircd::gpt::gpu: Add assert macro when trapping supported. 2022-10-09 03:29:29 +00:00
Jason Volk c1168fcc30 ircd::gpt: Resolve behavior of opts.limit: 0=analysis, -n=unlimited, n=limited. 2022-10-09 03:13:09 +00:00
Jason Volk 6fb9cd7bf1 ircd::tokens: Relax noexcept on token result closures. 2022-10-08 15:37:14 -07:00
Jason Volk a9b22c9052 ircd::lex: Handle js null as falsy boolean. 2022-10-08 10:49:41 -07:00
Jason Volk 4c44125d37 ircd::gpt::samp: Split dispatch branch into memfun. 2022-10-06 22:28:23 +00:00
Jason Volk 9682f406b3 ircd::gpt::gpu: Mute printf() on unsupporting platforms. 2022-10-06 22:26:19 +00:00
Jason Volk 2afaa4fa4f ircd::gpt::pipe: Remove cruft. 2022-10-06 22:05:52 +00:00
Jason Volk 442dad869d ircd::gpt: Resolve cycle count sampling; add debug log; fix count. 2022-10-06 22:01:41 +00:00
Jason Volk 0917a1f041 ircd::gpt::pipe: Resolve control page sync at ends of sample. 2022-10-06 18:54:29 +00:00
Jason Volk be3fd4be62 ircd::gpt::pipe: Add 5th profile col for OpenCL 2.0; graceful default output. 2022-10-05 13:53:52 -07:00
Jason Volk a9709568b4 ircd::gpt::pipe: Move the frame mapping command to the cycle. 2022-10-05 20:10:31 +00:00
Jason Volk 33afa8a4fc ircd::gpt::gpu: Add global fence between attn and ffnn accumulations (gfx1011). 2022-10-05 20:10:31 +00:00
Jason Volk c4cceb425c ircd::gpt::gpu: Use explicit broadcast for local access. 2022-10-05 20:08:40 +00:00
Jason Volk 03e5f255c2 ircd:🆑 Modernize queue properties; enable ooe. 2022-10-05 20:08:40 +00:00
Jason Volk 331a417656 ircd::gpt::gpu: Fix keywording for OpenCL 2.0+. 2022-10-02 01:30:10 +00:00
Jason Volk ae9f14bf86 ircd::gpt::pipe::code: Mute options for non-llvm pipe; attempt source build before bc. 2022-10-02 01:29:58 +00:00
Jason Volk ef7471defd ircd:🆑 Add debug logs for mem creations. 2022-10-01 22:06:45 +00:00
Jason Volk 736c8014dd ircd:🆑 Default device info queries on init; promote log level; fix dev ref. 2022-09-30 17:30:55 +00:00
Jason Volk 3d990f0890 Revert "ircd:🆑 Isolate pthread enable hooking during init/finit."
This reverts commit 1a29e591ef.
2022-09-30 17:30:44 +00:00
Jason Volk 35462afd04 ircd: Fix mods_ldso.o build flags as it includes boost/dll. 2022-09-30 17:30:39 +00:00