dendrite

mirror of https://github.com/matrix-org/dendrite synced 2024-11-09 03:11:27 +01:00

Author	SHA1	Message	Date
Till	eb29a31550	Optimize `/sync` and history visibility (#2961 ) Should fix the following issues or make a lot less worse when using Postgres: The main issue behind #2911: The client gives up after a certain time, causing a cascade of context errors, because the response couldn't be built up fast enough. This mostly happens on accounts with many rooms, due to the inefficient way we're getting recent events and current state For #2777: The queries for getting the membership events for history visibility were being executed for each room (I think 185?), resulting in a whooping 2k queries for membership events. (Getting the statesnapshot -> block nids -> actual wanted membership event) Both should now be better by: - Using a LATERAL join to get all recent events for all joined rooms in one go (TODO: maybe do the same for room summary and current state etc) - If we're lazy loading on initial syncs, we're now not getting the whole current state, just to drop the majority of it because we're lazy loading members - we add a filter to exclude membership events on the first call to `CurrentState`. - Using an optimized query to get the membership events needed to calculate history visibility --------- Co-authored-by: kegsay <kegan@matrix.org>	2023-02-07 14:31:23 +01:00
devonh	4738fe656f	Roomserver published pkey migration (#2960 ) Adds a missed migration to update the primary key on the roomserver_published table in postgres. Primary key was changed in #2836.	2023-02-01 16:32:31 +00:00
Neil	738686ae68	Add `/_dendrite/admin/purgeRoom/{roomID}` (#2662 ) This adds a new admin endpoint `/_dendrite/admin/purgeRoom/{roomID}`. It completely erases all database entries for a given room ID. The roomserver will start by clearing all data for that room and then will generate an output event to notify downstream components (i.e. the sync API and federation API) to do the same. It does not currently clear media and it is currently not implemented for SQLite since it relies on SQL array operations right now. Co-authored-by: Neil Alexander <neilalexander@users.noreply.github.com> Co-authored-by: Till Faelligen <2353100+S7evinK@users.noreply.github.com>	2023-01-19 21:02:32 +01:00
Till	7d2344049d	Cleanup stale device lists for users we don't share a room with anymore (#2857 ) The stale device lists table might contain entries for users we don't share a room with anymore. This now asks the roomserver about left users and removes those entries from the table. Co-authored-by: Neil Alexander <neilalexander@users.noreply.github.com>	2022-12-12 08:20:59 +01:00
Till	2a77a910eb	Handle remote room upgrades (#2866 ) Makes the following tests pass ``` /upgrade moves remote aliases to the new room Local and remote users' homeservers remove a room from their public directory on upgrade ```	2022-11-14 12:07:13 +00:00
Till	1e79b0557e	Use a writer to assign state key NIDs (#2877 )	2022-11-14 12:06:27 +00:00
Till Faelligen	e177e0ae73	Fix oops, add simple UT	2022-11-11 16:44:59 +01:00
Till	c648c671a3	Fix issue with missing user NIDs (#2874 ) This should fix #2696 and possibly other related issues regarding missing user NIDs. (https://github.com/matrix-org/dendrite/issues/2094?)	2022-11-11 10:52:43 +01:00
Neil Alexander	6663728eb1	Fix SQLite `roomserver_published` migration	2022-11-01 16:08:13 +00:00
Till	2acc1d65fb	Optimize history visibility checks (#2848 ) This optimizes history visibility checks by (mostly) avoiding database hits. Possibly solves https://github.com/matrix-org/dendrite/issues/2777 Co-authored-by: Neil Alexander <neilalexander@users.noreply.github.com>	2022-11-01 15:07:17 +00:00
Till Faelligen	a785532463	Fix upgrade appservices	2022-10-27 16:01:51 +02:00
Till	444b4bbdb8	Add AS specific public room list endpoints (#2836 ) Adds `PUT /_matrix/client/v3/directory/list/appservice/{networkId}/{roomId}` and `DELTE /_matrix/client/v3/directory/list/appservice/{networkId}/{roomId}` support, as well as the ability to filter `/publicRooms` on networkID and including all networks.	2022-10-27 14:40:35 +02:00
Till	3c1474f68f	Fix `/get_missing_events` for rooms with `joined`/`invited` history_visibility (#2787 ) Sytest was using a wrong `history_visibility` for `invited` (https://github.com/matrix-org/sytest/pull/1303), so `invited` was passing for the wrong reason (-> defaulted to `shared`, as `invite` wasn't understood). This change now handles missing events like Synapse, if a server isn't allowed to see the event, it gets a redacted version of it, making the `get_missing_events` tests pass.	2022-10-11 16:04:02 +02:00
Till	1ca3f3efb5	Fix issue with DMs shown as normal rooms (#2776 ) Fixes #2121, test added in https://github.com/matrix-org/complement/pull/494	2022-10-07 16:00:12 +02:00
Neil Alexander	8e231130e9	Revert "tDatabase transaction tweaks in roomserver" This reverts commit `8d8f4689a0`.	2022-10-07 14:05:06 +01:00
Neil Alexander	8d8f4689a0	tDatabase transaction tweaks in roomserver	2022-10-07 12:21:55 +01:00
Neil Alexander	c85bc3434f	Optimise `QuerySharedUsers` so that we can only work on local users (#2766 ) Otherwise the sync API key change consumer wastes a lot of time trying to wake up the notifiers for non-local users.	2022-10-05 12:47:53 +01:00
Neil Alexander	f022fc1397	Remove `origin` field from PDUs (#2737 ) This nukes the `origin` field from PDUs as per matrix-org/matrix-spec#998, matrix-org/gomatrixserverlib#341.	2022-09-26 17:35:35 +01:00
Till	100fa9b235	Check unique constraint errors when manually inserting migrations (#2712 ) This should avoid unnecessary logging on startup if the migration (were we need `InsertMigration`) was already executed. This now checks for "unique constraint errors" for SQLite and Postgres and fails the startup process if the migration couldn't be manually inserted for some other reason.	2022-09-13 08:07:43 +02:00
Neil Alexander	c0e17bbe1b	Fix transactions around assigning NIDs	2022-09-09 13:30:09 +01:00
Till	8196b29657	Change detection of already executed migrations (#2665 ) This changes the detection of already executed migrations for the roomserver state block and keychange refactor. It now uses schema tables provided by the database engine to check if the column was already removed. We now also store the migration in the migrations table. This should stop e.g. Postgres from logging errors like `ERROR: column "event_nid" does not exist at character 8`.	2022-09-09 13:14:52 +01:00
Neil Alexander	522bd2999f	Allow un-rejecting events on reprocessing	2022-08-24 14:03:06 +01:00
Neil Alexander	14fea600bb	Detect `types.MissingStateError` in `CheckServerAllowedToSeeEvent` (#2667 ) This will hopefully stop some 500 errors on `/event` where there is no state-before known.	2022-08-23 13:57:11 +01:00
Neil Alexander	6b48ce0d75	State handling tweaks (#2652 ) This tweaks how rejected events are handled in room state and also to not apply checks we can't complete to outliers.	2022-08-18 17:06:13 +01:00
Neil Alexander	59bc0a6f4e	Reprocess rejected input events (#2647 ) * Reprocess outliers that were previously rejected * Might as well do all events this way * More useful errors * Fix queries * Tweak condition * Don't wrap errors * Report more useful error * Flatten error on `r.Queryer.QueryStateAfterEvents` * Some more debug logging * Flatten error in `QueryRestrictedJoinAllowed` * Revert "Flatten error in `QueryRestrictedJoinAllowed`" This reverts commit `1238b4184c`. * Tweak `QueryStateAfterEvents` * Handle MissingStateError too * Scope to room * Clean up * Fix the error * Only apply rejection check to outliers	2022-08-18 10:37:47 +01:00
Till	03ddd98f5e	Fix issues with migrations not getting executed (#2628 ) * Fix issues with migrations not getting executed * Check actual postgres error * Return error if it's not "column does not exist"	2022-08-08 10:18:57 +02:00
Till	1b7f84250a	Fix linter issues (#2624 ) * Try that again * All hail the mighty linter? * And once again * goimport all the things	2022-08-05 11:12:41 +02:00
Neil Alexander	2250768be1	Remove roominfo cache (#2615 ) * Remove roominfo cache It's the source of a number of race conditions which are seemingly causing bugs and CI failures. * Make the linter less sad	2022-08-03 17:14:21 +01:00
Neil Alexander	ca3fa58388	Various roominfo tweaks (#2607 )	2022-08-02 12:27:15 +01:00
Neil Alexander	119cde3766	De-race `types.RoomInfo` (#2600 )	2022-08-01 15:29:19 +01:00
Neil Alexander	05c83923e3	Optimise checking other servers allowed to see events (#2596 ) * Try optimising checking if server is allowed to see event * Fix error * Handle case where snapshot NID is 0 * Fix query * Update SQL * Clean up `CheckServerAllowedToSeeEvent` * Not supported on SQLite * Maybe placate the unit tests * Review comments	2022-08-01 14:11:00 +01:00
Till	081f5e7226	Update database migrations, remove goose (#2264 ) * Add new db migration * Update migrations Remove goose * Add possibility to test direct upgrades * Try to fix WASM test * Add checks for specific migrations * Remove AddMigration Use WithTransaction Add Dendrite version to table * Fix linter issues * Update tests * Update comments, outdent if * Namespace migrations * Add direct upgrade tests, skipping over one version * Split migrations * Update go version in CI * Fix copy&paste mistake * Use contexts in migrations Co-authored-by: kegsay <kegan@matrix.org> Co-authored-by: Neil Alexander <neilalexander@users.noreply.github.com>	2022-07-25 10:39:22 +01:00
Neil Alexander	c7d978274d	Try to fix HTTP 500s on `/members` (#2581 )	2022-07-22 19:43:48 +01:00
Neil Alexander	f0c8a03649	Membership updater refactoring (#2541 ) * Membership updater refactoring * Pass in membership state * Use membership check rather than referring to state directly * Delete irrelevant membership states * We don't need the leave event after all * Tweaks * Put a log entry in that I might stand a chance of finding * Be less panicky * Tweak invite handling * Don't freak if we can't find the event NID * Use event NID from `types.Event` * Clean up * Better invite handling * Placate the almighty linter * Blacklist a Sytest which is otherwise fine under Complement for reasons I don't understand * Fix the sytest after all (thanks @S7evinK for the spot)	2022-07-22 14:44:04 +01:00
Till	9507966ebd	Fix issue with membership event_nid being 0 (#2580 )	2022-07-20 12:39:06 +02:00
Neil Alexander	5c01306bb5	Add event state key cache (#2576 )	2022-07-19 12:15:48 +01:00
Neil Alexander	a1f9b02edf	Pointerise `types.RoomInfo` in the cache so we can update it in-place in the latest events updater	2022-07-13 10:13:34 +01:00
Neil Alexander	3ea21273bc	Ristretto cache (#2563 ) * Try Ristretto cache * Tweak * It's beautiful * Update GMSL * More strict keyable interface * Fix that some more * Make less panicky * Don't enforce mutability checks for now * Determine mutability using deep equality * Tweaks * Namespace keys * Make federation caches mutable * Update cost estimation, add metric * Update GMSL * Estimate cost for metrics better * Reduce counters a bit * Try caching events * Some guards * Try again * Try this * Use separate caches for hopefully better hash distribution * Fix bug with admitting events into cache * Try to fix bugs * Check nil * Try that again * Preserve order jeezo this is messy * thanks VS Code for doing exactly the wrong thing * Try this again * Be more specific * aaaaargh * One more time * That might be better * Stronger sorting * Cache expiries, async publishing of EDUs * Put it back * Use a shared cache again * Cost estimation fixes * Update ristretto * Reduce counters a bit * Clean up a bit * Update GMSL * 1GB * Configurable cache sizees * Tweaks * Add `config.DataUnit` for specifying friendly cache sizes * Various tweaks * Update GMSL * Add back some lazy loading caching * Include key in cost * Include key in cost * Tweak max age handling, config key name * Only register prometheus metrics if requested * Review comments @S7evinK * Don't return errors when creating caches (it is better just to crash since otherwise we'll `nil`-pointer exception everywhere) * Review comments * Update sample configs * Update GHA Workflow * Update Complement images to Go 1.18 * Remove the cache test from the federation API as we no longer guarantee immediate cache admission * Don't check the caches in the renewal test * Possibly fix the upgrade tests * Update to matrix-org/gomatrixserverlib#322 * Update documentation to refer to Go 1.18	2022-07-11 14:31:31 +01:00
Till	f3e8a9a4cb	Fix nil pointer access when redacting events (#2560 )	2022-07-07 11:40:53 +02:00
Neil Alexander	d4341a2d97	Return clearer error when no state NID exists for an event (#2555 )	2022-07-05 15:01:34 +01:00
Till	5087b36af0	Fix QuerySharedUsers for the SyncAPI keychange consumer (#2554 ) * Make more use of base.BaseDendrite * Fix QuerySharedUsers if no UserIDs are supplied	2022-07-05 14:50:56 +02:00
Till	660f7839f5	Correctly redact events over federation (#2526 ) * Ensure we check powerlevel/origin before redacting an event * Add passing test * Use pl.UserLevel * Make check more readable, also check for the sender	2022-06-09 18:38:07 +02:00
Neil Alexander	3d9fe20748	Fix bugs related to state resolution (#2507 ) * Fix bugs related to state resolution * Clean up `resolve-state` * Don't panic when entries can't be found * Ensure we have state entries for the auth events * Revert "Ensure we have state entries for the auth events" This reverts commit `9b13b7ed37`. * Revert "Revert "Ensure we have state entries for the auth events"" This reverts commit `d86db197e3`. * Fix bug * Try that again * Update gomatrixserverlib * Remove recursion from `loadAuthEvents`	2022-06-01 09:46:21 +01:00
Neil Alexander	6940c7c7dd	Try to spot state deletions when they happen (#2489 )	2022-05-25 16:40:31 +01:00
kegsay	6de29c1cd2	bugfix: E2EE device keys could sometimes not be sent to remote servers (#2466 ) * Fix flakey sytest 'Local device key changes get to remote servers' * Debug logs * Remove internal/test and use /test only Remove a lot of ancient code too. * Use FederationRoomserverAPI in more places * Use more interfaces in federationapi; begin adding regression test * Linting * Add regression test * Unbreak tests * ALL THE LOGS * Fix a race condition which could cause events to not be sent to servers If a new room event which rewrites state arrives, we remove all joined hosts then re-calculate them. This wasn't done in a transaction so for a brief period we would have no joined hosts. During this interim, key change events which arrive would not be sent to destination servers. This would sporadically fail on sytest. * Unbreak new tests * Linting	2022-05-17 13:23:35 +01:00
Till	05607d6b87	Add roomserver tests (3/4) (#2447 ) * Add Room Aliases tests * Add Rooms table test * Move StateKeyTuplerSorter to the types package * Add StateBlock tests Some optimizations * Add State Snapshot tests Some optimization * Return []int64 and convert to pq.Int64Array for postgres * Move []types.EventNID back to rows.Next() * Update tests, rename SelectRoomIDs	2022-05-16 19:33:16 +02:00
Till	6db08b2874	Add roomserver tests (2/?) (#2445 ) * Add invite table tests; move variable declarations * Add Membership table tests * Move variable declarations * Add PrevEvents table tests * Add Published table test * Add Redactions tests Fix bug in SQLite markRedactionValidatedSQL * PR comments, better readability for invite tests	2022-05-10 14:41:12 +02:00
Till	f69ebc6af2	Add roomserver tests (1/?) (#2434 ) * Add EventJSONTable tests * Add eventJSON tests * Add EventStateKeysTable tests * Add EventTypesTable tests * Add Events Table tests Move variable declaration outside loops Switch to testify/assert for tests * Move variable declaration outside loop * Remove random data * Fix issue where the EventReferenceSHA256 is not set * Add more tests * Revert "Fix issue where the EventReferenceSHA256 is not set" This reverts commit `8ae34c4e5f`. * Update GMSL * Add tests for duplicate entries * Test what happens if we select non-existing NIDs * Add test for non-existing eventType * Really update GMSL	2022-05-09 15:30:32 +02:00
Neil Alexander	4ad5f9c982	Global database connection pool (for monolith mode) (#2411 ) * Allow monolith components to share a single database pool * Don't yell about missing connection strings * Rename field * Setup tweaks * Fix panic * Improve configuration checks * Update config * Fix lint errors * Update comments	2022-05-03 16:35:06 +01:00
Neil Alexander	d983d17355	Fix lint errors	2022-03-24 10:03:22 +00:00

1 2 3 4

182 commits