synapse

mirror of https://mau.dev/maunium/synapse.git synced 2024-11-05 22:28:54 +01:00

Author	SHA1	Message	Date
Hillery Shay	f78b68a96b	Treat "\u0000" as "\u0020" for the purposes of message search (message indexing) (#10820 ) * add test to check if null code points are being inserted * add logic to detect and replace null code points before insertion into db * lints * add license to test * change approach to null substitution * add type hint for SearchEntry * Add changelog entry Signed-off-by: H.Shay <shaysquared@gmail.com> * updated changelog * update chanelog message * remove duplicate changelog * Update synapse/storage/databases/main/events.py remove extra space Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com> * rename and move test file, update tests, delete old test file * fix typo in comments * update _find_highlights_in_postgres to replace null byte with space * replace null byte in sqlite search insertion * beef up and reorganize test for this pr * update changelog * add type hints and update docstring * check db engine directly vs using env variable * refactor tests to be less repetetive * move rplace logic into seperate function * requested changes * Fix typo. * Update synapse/storage/databases/main/search.py Co-authored-by: reivilibre <olivier@librepush.net> * Update changelog.d/10820.misc Co-authored-by: Aaron Raimist <aaron@raim.ist> Co-authored-by: Patrick Cloke <clokep@users.noreply.github.com> Co-authored-by: reivilibre <olivier@librepush.net> Co-authored-by: Aaron Raimist <aaron@raim.ist>	2021-09-22 08:25:26 -07:00
reivilibre	8eb7cb2e0d	Make StateFilter frozen so we can hash it (#10816 ) Also enables Mypy for related tests.	2021-09-14 16:35:53 +01:00
Erik Johnston	74f01e11c9	Skip handling of push actions for outlier events (#10780 ) Outlier events don't ever have push actions associated with them, so we can skip some expensive queries during event persistence.	2021-09-08 15:18:35 +01:00
Eric Eastwood	dc75fb7f05	Populate `rooms.creator` field for easy lookup (#10697 ) Part of https://github.com/matrix-org/synapse/pull/10566 - Fill in creator whenever we insert into the rooms table - Add background update to backfill any missing creator values	2021-09-01 16:27:58 +01:00
reivilibre	642a42edde	Flatten the synapse.rest.client package (#10600 )	2021-08-17 11:57:58 +00:00
Erik Johnston	c37dad67ab	Improve event caching code (#10119 ) Ensure we only load an event from the DB once when the same event is requested multiple times at once.	2021-08-04 13:54:51 +01:00
reivilibre	fb086edaed	Fix codestyle CI from #10440 (#10511 ) Co-authored-by: Erik Johnston <erik@matrix.org>	2021-08-02 15:50:22 +00:00
Erik Johnston	01d45fe964	Prune inbound federation queues if they get too long (#10390 )	2021-08-02 13:37:25 +00:00
Toni Spets	ba5287f5e8	Allow setting transaction limit for db connections (#10440 ) Setting the value will help PostgreSQL free up memory by recycling the connections in the connection pool. Signed-off-by: Toni Spets <toni.spets@iki.fi>	2021-08-02 13:24:43 +00:00
Patrick Cloke	228decfce1	Update the MSC3083 support to verify if joins are from an authorized server. (#10254 )	2021-07-26 12:17:00 -04:00
Erik Johnston	54389d5697	Fix dropping locks on shut down (#10433 )	2021-07-20 14:24:25 +01:00
Jonathan de Jong	93729719b8	Use inline type hints in `tests/` (#10350 ) This PR is tantamount to running: python3.8 -m com2ann -v 6 tests/ (com2ann requires python 3.8 to run)	2021-07-13 11:52:58 +01:00
Jonathan de Jong	89cfc3dd98	[pyupgrade] `tests/` (#10347 )	2021-07-13 11:43:15 +01:00
Erik Johnston	85d237eba7	Add a distributed lock (#10269 ) This adds a simple best effort locking mechanism that works cross workers.	2021-06-29 19:15:47 +01:00
Eric Eastwood	96f6293de5	Add endpoints for backfilling history (MSC2716) (#9247 ) Work on https://github.com/matrix-org/matrix-doc/pull/2716	2021-06-22 10:02:53 +01:00
Marcus	8070b893db	update black to 21.6b0 (#10197 ) Reformat all files with the new version. Signed-off-by: Marcus Hoffmann <bubu@bubu1.eu>	2021-06-17 15:20:06 +01:00
Richard van der Hoff	b4b2fd2ece	add a cache to have_seen_event (#9953 ) Empirically, this helped my server considerably when handling gaps in Matrix HQ. The problem was that we would repeatedly call have_seen_events for the same set of (50K or so) auth_events, each of which would take many minutes to complete, even though it's only an index scan.	2021-06-01 12:04:47 +01:00
Erik Johnston	3e831f24ff	Don't hammer the database for destination retry timings every ~5mins (#10036 )	2021-05-21 17:57:08 +01:00
Richard van der Hoff	25f43faa70	Reorganise the database schema directories (#9932 ) The hope here is that by moving all the schema files into synapse/storage/schema, it gets a bit easier for newcomers to navigate. It certainly got easier for me to write a helpful README. There's more to do on that front, but I'll follow up with other PRs for that.	2021-05-07 10:22:05 +01:00
Andrew Morgan	fe604a022a	Remove various bits of compatibility code for Python <3.6 (#9879 ) I went through and removed a bunch of cruft that was lying around for compatibility with old Python versions. This PR also will now prevent Synapse from starting unless you're running Python 3.6+.	2021-04-27 13:13:07 +01:00
Jonathan de Jong	495b214f4f	Fix (final) Bugbear violations (#9838 )	2021-04-20 11:50:49 +01:00
Jonathan de Jong	4b965c862d	Remove redundant "coding: utf-8" lines (#9786 ) Part of #9744 Removes all redundant `# -- coding: utf-8 --` lines from files, as python 3 automatically reads source code as utf-8 now. `Signed-off-by: Jonathan de Jong <jonathan@automatia.nl>`	2021-04-14 15:34:27 +01:00
Patrick Cloke	0b3112123d	Use mock from the stdlib. (#9772 )	2021-04-09 13:44:38 -04:00
Dirk Klimpel	48a1f4db31	Remove old admin API `GET /_synapse/admin/v1/users/<user_id>` (#9401 ) Related: #8334 Deprecated in: #9429 - Synapse 1.28.0 (2021-02-25) `GET /_synapse/admin/v1/users/<user_id>` has no - unit tests - documentation API in v2 is available (#5925 - 12/2019, v1.7.0). API is misleading. It expects `user_id` and returns a list of all users. Signed-off-by: Dirk Klimpel dirk@klimpel.org	2021-04-09 09:44:40 +01:00
Jonathan de Jong	2ca4e349e9	Bugbear: Add Mutable Parameter fixes (#9682 ) Part of #9366 Adds in fixes for B006 and B008, both relating to mutable parameter lint errors. Signed-off-by: Jonathan de Jong <jonathan@automatia.nl>	2021-04-08 22:38:54 +01:00
Richard van der Hoff	9e167d9c53	Merge remote-tracking branch 'origin/develop' into rav/drop_py35	2021-04-08 18:30:38 +01:00
Richard van der Hoff	24c58ebfc9	remove unused param on `make_tuple_comparison_clause`	2021-04-08 18:29:57 +01:00
Richard van der Hoff	3ada9b4264	Drop support for sqlite<3.22 as well	2021-04-08 16:42:32 +01:00
Patrick Cloke	e7b769aea1	Convert storage test cases to HomeserverTestCase. (#9736 )	2021-04-06 07:21:02 -04:00
Patrick Cloke	01dd90b0f0	Add type hints to DictionaryCache and TTLCache. (#9442 )	2021-03-29 12:15:33 -04:00
Patrick Cloke	2a99cc6524	Use the chain cover index in get_auth_chain_ids. (#9576 ) This uses a simplified version of get_chain_cover_difference to calculate auth chain of events.	2021-03-10 09:57:59 -05:00
Patrick Cloke	cb7fc7523e	Add a basic test for purging rooms. (#9541 ) Unfortunately this doesn't test re-joining the room since that requires having another homeserver to query over federation, which isn't easily doable in unit tests.	2021-03-08 09:21:36 -05:00
Dirk Klimpel	c8d9383cfb	Add the shadow-banning status to the display user admin API. (#9400 )	2021-02-17 15:19:23 -05:00
Eric Eastwood	0a00b7ff14	Update black, and run auto formatting over the codebase (#9381 ) - Update black version to the latest - Run black auto formatting over the codebase - Run autoformatting according to [`docs/code_style.md `](`80d6dc9783/docs/code_style.md`) - Update `code_style.md` docs around installing black to use the correct version	2021-02-16 22:32:34 +00:00
Erik Johnston	6633a4015a	Allow moving account data and receipts streams off master (#9104 )	2021-01-18 15:47:59 +00:00
Erik Johnston	350d9923cd	Make chain cover index bg update go faster (#9124 ) We do this by allowing a single iteration to process multiple rooms at a time, as there are often a lot of really tiny rooms, which can massively slow things down.	2021-01-15 17:18:37 +00:00
Erik Johnston	7036e24e98	Add background update for add chain cover index (#9029 )	2021-01-14 15:18:27 +00:00
Dirk Klimpel	7a2e9b549d	Remove user's avatar URL and displayname when deactivated. (#8932 ) This only applies if the user's data is to be erased.	2021-01-12 16:30:15 -05:00
Erik Johnston	1315a2e8be	Use a chain cover index to efficiently calculate auth chain difference (#8868 )	2021-01-11 16:09:22 +00:00
Patrick Cloke	23d701864f	Improve the performance of calculating ignored users in large rooms (#9024 ) This allows for efficiently finding which users ignore a particular user. Co-authored-by: Erik Johnston <erik@matrix.org>	2021-01-07 13:03:38 +00:00
Erik Johnston	70586aa63e	Try and drop stale extremities. (#8929 ) If we see stale extremities while persisting events, and notice that they don't change the result of state resolution, we drop them.	2020-12-18 09:49:18 +00:00
Brendan Abolivier	f2783fc201	Use the simple dictionary in full text search for the user directory (#8959 ) * Use the simple dictionary in fts for the user directory * Clarify naming	2020-12-17 14:42:30 +01:00
Dirk Klimpel	06006058d7	Make search statement in List Room and User Admin API case-insensitive (#8931 )	2020-12-17 10:43:37 +00:00
Dirk Klimpel	0a34cdfc66	Add number of local devices to Room Details Admin API (#8886 )	2020-12-11 10:42:47 +00:00
Erik Johnston	df4b1e9c74	Pass room_id to get_auth_chain_difference (#8879 ) This is so that we can choose which algorithm to use based on the room ID.	2020-12-04 15:52:49 +00:00
Richard van der Hoff	90cf1eec44	Remove redundant mocking	2020-12-02 17:53:38 +00:00
Patrick Cloke	30fba62108	Apply an IP range blacklist to push and key revocation requests. (#8821 ) Replaces the `federation_ip_range_blacklist` configuration setting with an `ip_range_blacklist` setting with wider scope. It now applies to: * Federation * Identity servers * Push notifications * Checking key validitity for third-party invite events The old `federation_ip_range_blacklist` setting is still honored if present, but with reduced scope (it only applies to federation and identity servers).	2020-12-02 11:09:24 -05:00
Erik Johnston	c5b6abd53d	Correctly handle unpersisted events when calculating auth chain difference. (#8827 ) We do state res with unpersisted events when calculating the new current state of the room, so that should be the only thing impacted. I don't think this is tooooo big of a deal as: 1. the next time a state event happens in the room the current state should correct itself; 2. in the common case all the unpersisted events' auth events will be pulled in by other state, so will still return the correct result (or one which is sufficiently close to not affect the result); and 3. we mostly use the state at an event to do important operations, which isn't affected by this.	2020-12-02 15:22:37 +00:00
Dirk Klimpel	3f0ff53158	Remove deprecated `/_matrix/client/*/admin` endpoints (#8785 ) These are now only available via `/_synapse/admin/v1`.	2020-11-25 16:26:11 -05:00
Richard van der Hoff	deff8f628d	Merge pull request #8761 from matrix-org/rav/test_request_rendering Make `make_request` actually render the request	2020-11-17 15:17:04 +00:00
Erik Johnston	f737368a26	Add admin API for logging in as a user (#8617 )	2020-11-17 10:51:25 +00:00
Richard van der Hoff	be8fa65d0b	Remove redundant calls to `render()`	2020-11-16 18:24:08 +00:00
Richard van der Hoff	c3e3552ec4	fixup test	2020-11-16 15:51:47 +00:00
Richard van der Hoff	ebc405446e	Add a `custom_headers` param to `make_request` (#8760 ) Some tests want to set some custom HTTP request headers, so provide a way to do that before calling requestReceived().	2020-11-16 14:45:22 +00:00
Erik Johnston	f21e24ffc2	Add ability for access tokens to belong to one user but grant access to another user. (#8616 ) We do it this way round so that only the "owner" can delete the access token (i.e. `/logout/all` by the "owner" also deletes that token, but `/logout/all` by the "target user" doesn't). A future PR will add an API for creating such a token. When the target user and authenticated entity are different the `Processed request` log line will be logged with a: `{@admin:server as @bob:server} ...`. I'm not convinced by that format (especially since it adds spaces in there, making it harder to use `cut -d ' '` to chop off the start of log lines). Suggestions welcome.	2020-10-29 15:58:44 +00:00
Dan Callahan	aff1eb7c67	Tell Black to format code for Python 3.5 (#8664 ) This allows trailing commas in multi-line arg lists. Minor, but we might as well keep our formatting current with regard to our minimum supported Python version. Signed-off-by: Dan Callahan <danc@element.io>	2020-10-27 23:26:36 +00:00
Will Hunt	e8dbbcb64c	Fix get\|set_type_stream_id_for_appservice store functions (#8648 )	2020-10-26 10:51:33 -04:00
Erik Johnston	2ac908f377	Don't instansiate Requester directly (#8614 )	2020-10-22 10:11:06 +01:00
Richard van der Hoff	7b71695388	Combine the two sets of tests for CacheDescriptor	2020-10-21 15:38:29 +01:00
Will Hunt	c276bd9969	Send some ephemeral events to appservices (#8437 ) Optionally sends typing, presence, and read receipt information to appservices.	2020-10-15 12:33:28 -04:00
Richard van der Hoff	0a08cd1065	Merge pull request #8548 from matrix-org/rav/deferred_cache Rename Cache to DeferredCache, and related changes	2020-10-15 11:42:07 +01:00
Richard van der Hoff	470dedd266	Combine the two sets of DeferredCache tests	2020-10-14 23:49:27 +01:00
Richard van der Hoff	4182bb812f	move DeferredCache into its own module	2020-10-14 23:38:14 +01:00
Richard van der Hoff	9f87da0a84	Rename Cache->DeferredCache	2020-10-14 23:38:14 +01:00
Richard van der Hoff	a34b17e492	Simplify `_locally_reject_invite` Update `EventCreationHandler.create_event` to accept an auth_events param, and use it in `_locally_reject_invite` instead of reinventing the wheel.	2020-10-13 23:58:48 +01:00
Erik Johnston	8de3703d21	Make event persisters periodically announce position over replication. (#8499 ) Currently background proccesses stream the events stream use the "minimum persisted position" (i.e. `get_current_token()`) rather than the vector clock style tokens. This is broadly fine as it doesn't matter if the background processes lag a small amount. However, in extreme cases (i.e. SyTests) where we only write to one event persister the background processes will never make progress. This PR changes it so that the `MultiWriterIDGenerator` keeps the current position of a given instance as up to date as possible (i.e using the latest token it sees if its not in the process of persisting anything), and then periodically announces that over replication. This then allows the "minimum persisted position" to advance, albeit with a small lag.	2020-10-12 15:51:41 +01:00
Erik Johnston	ae5b2a72c0	Reduce serialization errors in MultiWriterIdGen (#8456 ) We call `_update_stream_positions_table_txn` a lot, which is an UPSERT that can conflict in `REPEATABLE READ` isolation level. Instead of doing a transaction consisting of a single query we may as well run it outside of a transaction.	2020-10-07 15:15:57 +01:00
Erik Johnston	e3debf9682	Add logging on startup/shutdown (#8448 ) This is so we can tell what is going on when things are taking a while to start up. The main change here is to ensure that transactions that are created during startup get correctly logged like normal transactions.	2020-10-02 15:20:45 +01:00
Erik Johnston	7941372ec8	Make token serializing/deserializing async (#8427 ) The idea is that in future tokens will encode a mapping of instance to position. However, we don't want to include the full instance name in the string representation, so instead we'll have a mapping between instance name and an immutable integer ID in the DB that we can use instead. We'll then do the lookup when we serialize/deserialize the token (we could alternatively pass around an `Instance` type that includes both the name and ID, but that turns out to be a lot more invasive).	2020-09-30 20:29:19 +01:00
Richard van der Hoff	6d2d42f8fb	Rewrite BucketCollector This was a bit unweildy for what I wanted: in particular, I wanted to assign each measurement straight into a bucket, rather than storing an intermediate Counter which didn't do any bucketing at all. I've replaced it with something that is hopefully a bit easier to use. (I'm not entirely sure what the difference between a HistogramMetricFamily and a GaugeHistogramMetricFamily is, but given our counters can go down as well as up the latter sounds more accurate?)	2020-09-30 16:49:15 +01:00
Erik Johnston	ea70f1c362	Various clean ups to room stream tokens. (#8423 )	2020-09-29 21:48:33 +01:00
Erik Johnston	b1433bf231	Don't table scan events on worker startup (#8419 ) * Fix table scan of events on worker startup. This happened because we assumed "new" writers had an initial stream position of 0, so the replication code tried to fetch all events written by the instance between 0 and the current position. Instead, set the initial position of new writers to the current persisted up to position, on the assumption that new writers won't have written anything before that point. * Consider old writers coming back as "new". Otherwise we'd try and fetch entries between the old stale token and the current position, even though it won't have written any rows. Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com> Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2020-09-29 16:42:19 +01:00
Will Hunt	8676d8ab2e	Filter out appservices from mau count (#8404 ) This is an attempt to fix #8403.	2020-09-29 13:11:02 +01:00
Erik Johnston	bd380d942f	Add checks for postgres sequence consistency (#8402 )	2020-09-28 18:00:30 +01:00
Erik Johnston	f112cfe5bb	Fix MultiWriteIdGenerator's handling of restarts. (#8374 ) On startup `MultiWriteIdGenerator` fetches the maximum stream ID for each instance from the table and uses that as its initial "current position" for each writer. This is problematic as a) it involves either a scan of events table or an index (neither of which is ideal), and b) if rows are being persisted out of order elsewhere while the process restarts then using the maximum stream ID is not correct. This could theoretically lead to race conditions where e.g. events that are persisted out of order are not sent down sync streams. We fix this by creating a new table that tracks the current positions of each writer to the stream, and update it each time we finish persisting a new entry. This is a relatively small overhead when persisting events. However for the cache invalidation stream this is a much bigger relative overhead, so instead we note that for invalidation we don't actually care about reliability over restarts (as there's no caches to invalidate) and simply don't bother reading and writing to the new table in that particular case.	2020-09-24 16:53:51 +01:00
Erik Johnston	cbabb312e0	Use `async with` for ID gens (#8383 ) This will allow us to hit the DB after we've finished using the generated stream ID.	2020-09-23 16:11:18 +01:00
Patrick Cloke	8a4a4186de	Simplify super() calls to Python 3 syntax. (#8344 ) This converts calls like super(Foo, self) -> super(). Generated with: sed -i "" -Ee 's/super\([^\(]+\)/super()/g' */.py	2020-09-18 09:56:44 -04:00
Erik Johnston	deedb91732	Fix `MultiWriterIdGenerator.current_position`. (#8257 ) It did not correctly handle IDs finishing being persisted out of order, resulting in the `current_position` lagging until new IDs are persisted.	2020-09-08 14:26:54 +01:00
Patrick Cloke	cef00211c8	Allow for make_awaitable's return value to be re-used. (#8261 )	2020-09-08 07:26:55 -04:00
Patrick Cloke	c619253db8	Stop sub-classing object (#8249 )	2020-09-04 06:54:56 -04:00
Brendan Abolivier	5a1dd297c3	Re-implement unread counts (again) (#8059 )	2020-09-02 17:19:37 +01:00
Erik Johnston	bbb3c8641c	Make MultiWriterIDGenerator work for streams that use negative stream IDs (#8203 ) This is so that we can use it for the backfill events stream.	2020-09-01 13:36:25 +01:00
Richard van der Hoff	45e8f7726f	Rename `get_e2e_device_keys` to better reflect its purpose (#8205 ) ... and to show that it does something slightly different to `_get_e2e_device_keys_txn`. `include_all_devices` and `include_deleted_devices` were never used (and `include_deleted_devices` was broken, since that would cause `None`s in the result which were not handled in the loop below. Add some typing too.	2020-08-29 00:14:17 +01:00
Patrick Cloke	e00816ad98	Do not yield on awaitables in tests. (#8193 )	2020-08-27 17:24:46 -04:00
Patrick Cloke	b49a5b9307	Convert stats and related calls to async/await (#8192 )	2020-08-27 17:24:37 -04:00
Patrick Cloke	b71d4a094c	Convert simple_delete to async/await. (#8191 )	2020-08-27 14:16:41 -04:00
Erik Johnston	5649b7f3d0	Fix missing _add_persisted_position (#8179 ) This was forgotten in #8164.	2020-08-27 13:20:34 +01:00
Patrick Cloke	30426c7063	Convert additional database methods to async (select list, search, insert_many, delete_*) (#8168 )	2020-08-27 07:41:01 -04:00
Patrick Cloke	4a739c73b4	Convert simple_update* and simple_select* to async (#8173 )	2020-08-27 07:08:38 -04:00
Andrew Morgan	a466b67972	Reduce run-times of tests by advancing the reactor less (#7757 )	2020-08-27 11:39:53 +01:00
Patrick Cloke	4c6c56dc58	Convert simple_select_one and simple_select_one_onecol to async (#8162 )	2020-08-26 07:19:32 -04:00
Erik Johnston	eba98fb024	Add functions to `MultiWriterIdGen` used by events stream (#8164 )	2020-08-25 17:32:30 +01:00
Brendan Abolivier	3f49f74610	Don't fail /submit_token requests on incorrect session ID if request_token_inhibit_3pid_errors is turned on (#7991 ) * Don't raise session_id errors on submit_token if request_token_inhibit_3pid_errors is set * Changelog * Also wait some time before responding to /requestToken * Incorporate review * Update synapse/storage/databases/main/registration.py Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com> * Incorporate review Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com>	2020-08-24 11:33:55 +01:00
Patrick Cloke	f594e434c3	Switch the JSON byte producer from a pull to a push producer. (#8116 )	2020-08-19 08:07:57 -04:00
Erik Johnston	76d21d14a0	Separate `get_current_token` into two. (#8113 ) The function is used for two purposes: 1) for subscribers of streams to get a token they can use to get further updates with, and 2) for replication to track position of the writers of the stream. For streams with a single writer the two scenarios produce the same result, however the situation becomes complicated for streams with multiple writers. The current `MultiWriterIdGenerator` does not correctly handle the first case (which is not an issue as its only used for the `caches` stream which nothing subscribes to outside of replication).	2020-08-19 10:39:31 +01:00
Patrick Cloke	f40645e60b	Convert events worker database to async/await. (#8071 )	2020-08-18 16:20:49 -04:00
Patrick Cloke	050e20e7ca	Convert some of the general database methods to async (#8100 )	2020-08-17 12:18:01 -04:00
Patrick Cloke	ad6190c925	Convert stream database to async/await. (#8074 )	2020-08-17 07:24:46 -04:00
Patrick Cloke	ac77cdb64e	Add a shadow-banned flag to users. (#8092 )	2020-08-14 12:37:59 -04:00
Patrick Cloke	5ecc8b5825	Convert devices database to async/await. (#8069 )	2020-08-12 10:51:42 -04:00

1 2 3 4 5 ...

509 commits