zulip

mirror of https://github.com/zulip/zulip.git synced 2025-11-22 07:21:23 +00:00

Author	SHA1	Message	Date
Steve Howell	b894597fa3	refactor: Use sets of stream_ids for helper args.	2020-10-15 15:12:01 -07:00
Steve Howell	3889554977	refactor: Extract send_peer_remove_events.	2020-10-15 15:12:01 -07:00
Steve Howell	f86823f82f	tests: Add cache_tries_captured helper.	2020-10-15 15:12:01 -07:00
Steve Howell	ce70d08cbf	test_helpers: Use mock.patch.multiple.	2020-10-15 15:12:01 -07:00
Tim Abbott	bf66e9c4ab	actions: Add transaction.atomic to bulk_add_subs_to_db_with_logging. This will ensure that we always fully execute the database part of modifying subscription objects. In particular, this should prevent invariant failures like #16347 where Subscription objects were created without corresponding RealmAuditLog entries. Fixes #16347.	2020-10-14 11:06:00 -07:00
Steve Howell	5728149e94	performance: Streamline query to add subscribers. We don't need the select_related('user_profile') optimization any more, because we just keep track of user info in our own data structures. In this codepath we are never actually modifying users; we just occasionally need their ids or emails. This can be a pretty substantive improvement if you are adding a bunch of users to a stream who each have a bunch of their own subscriptions. We could also limit the number of full rows in this query by adding an extra hop to the DB just to get colors (using values_list), and then only get full sub info for the streams that we're adding, rather than getting every single subscription, in full, for each user. Apart from finding what colors the user has already used, the only other reason we need all the columns in Subscription here is to handle streams that need to be reactivated. Otherwise we could do only("id", "active", "recipient_id", "user_profile_id") or similar. Fortunately, Subscription isn't an overly wide table; it's mostly bool fields. But by far the biggest thing to avoid is bringing in all the extra user_profile data. We have pretty good coverage on query counts here, so I think this fix is pretty low risk.	2020-10-14 11:03:07 -07:00
Steve Howell	116a441bc5	refactor: Introduce SubInfo class. This class removes a lot of the annoying tuples we were passing around. Also, by including the user everywhere, which is easily available to us when we make instances of SubInfo, it sets the stage to remove select_related('user_profile').	2020-10-14 10:53:10 -07:00
Steve Howell	febef45e38	minor: Add comments to do_get_streams.	2020-10-14 10:53:10 -07:00
Steve Howell	a9356508ca	events: Stop sending occupy/vacate events. We used to send occupy/vacate events when either the first person entered a stream or the last person exited. It appears that our two main apps have never looked at these events. Instead, it's generally the case that clients handle events related to stream creation/deactivation and subscribe/unsubscribe. Note that we removed the apply_events code related to these events. This doesn't affect the webapp, because the webapp doesn't care about the "streams" field in do_events_register. There is a theoretical situation where a third party client could be the victim of a race where the "streams" data includes a stream where the last subscriber has left. I suspect in most of those situations it will be harmless, or possibly even helpful to the extent that they'll learn about streams that are in a "quasi" state where they're activated but not occupied. We could try to patch apply_event to detect when subscriptions get added or removed. Or we could just make the "streams" piece of do_events_register not care about occupy/vacate semantics. I favor the latter, since it might actually be what users what, and it will also simplify the code and improve performance.	2020-10-14 10:53:10 -07:00
Steve Howell	1bcb8d8ee8	performance: Avoid computing page_params.streams in webapp. The query to get "occupied" streams has been expensive in the past. I'm not sure how much any recent attempts to optimize that query have mitigated the issue, but since we clearly aren't sending this data, there is no reason to compute it.	2020-10-14 10:53:10 -07:00
Steve Howell	79803f01f4	minor: Format some code in events.py.	2020-10-14 10:53:10 -07:00
Aman Agrawal	fbf7cb82a7	web_public_guest: Rename to web_public_visitor for clarity. Using web_public_guest for anonymous users is confusing since 'guest' is actually a logged-in user compared to web_public_guest which is not logged-in and has only read access to messages. So, we rename it to web_public_visitor.	2020-10-13 16:59:52 -07:00
Steve Howell	3b338ec32e	performance: Optimize filter_stream_authorization. We no longer do O(N) queries to get existing streams. This is a somewhat contrived use case--generally, we are not trying to re-subscribe a user to several streams. Still, we want to avoid this. This commit also makes `test_bulk_subscribe_many` do more work, and the change to the test helped me discover this bug.	2020-10-13 18:54:55 -04:00
Anders Kaseorg	6564540d15	docs: Fix some spelling errors. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Anders Kaseorg	dd48dbd912	docs: Add spaces to “check out”, “log in”, “set up”, “sign up” as verbs. “Checkout”, “login”, “setup”, and “signup” are nouns, not verbs. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-13 15:47:13 -07:00
Steve Howell	598601e8fc	stream events: Prevent spurious events. If a user asks to be subscribed to a stream that they are already subscribed to, then that stream won't be in new_stream_user_ids, and we won't need to send an event for it. This change makes that happen more automatically.	2020-10-13 11:28:17 -07:00
Steve Howell	18771099e4	performance: Introduce new_stream_user_ids. Let U = number of users to subscribe S = number of streams to subscribe We were technically doing N^3 amount of work when we sent certain events, or to be more precise, U * S * S amount of work. For each stream, we were looping through a list of tuples of size U * S to find the users for the stream. In practice either U or S is usually 1, so the performance gains here are probably negligible, especially since the constant factors here were just slinging around Python data. But the code is actually more readable now, so it's a double win.	2020-10-13 11:28:17 -07:00
Steve Howell	ebb605319b	refactor: Rename stream_map to recipient_id_to_stream. I want to make a new dict called stream_id_to_stream, and stream_map would be confusing.	2020-10-13 11:28:17 -07:00
Steve Howell	b502957184	refactor: Extract new_recipient_ids local. We rename needs_new_sub (which sounds like a boolean!) to new_recipient_ids, and we calculate it explicitly within the loop, so that we don't need to worry as much about subsequent passes through the loop mutating it. This allows us to also remove recipient_ids, which in turn lets us remove recipients_map, albeit with a small tweak for stream_map. I also introduce the my_subs local, which I use to more directly populate used_colors, as well as using it as the loop var.	2020-10-13 11:28:17 -07:00
Steve Howell	766892d8aa	import: Reuse get_last_message_id() helper.	2020-10-13 11:28:17 -07:00
Steve Howell	9df9934ed6	refactor: Pass realm to bulk_add_subscriptions. I think it's important that the callers understand that bulk_add_subscriptions assumes all streams are being created within a single realm, so I make it an explicit parameter. This may be overkill--I would also be happy if we just included the assertions from this commit.	2020-10-13 11:28:17 -07:00
Steve Howell	efc931a671	minor: Extract realm local.	2020-10-13 11:28:17 -07:00
Steve Howell	b2d0a2efb9	refactor: Extract send_subscription_add_events. This function now does all the work that we used to do with notify_subscriptions_added happening inside a loop. There's a small fine-tuning here, where we only get recent traffic on streams that we're actually sending events for.	2020-10-13 11:28:17 -07:00
Steve Howell	223ce83a0a	refactor: Clean up call to notify_subscriptions_added. We now just pass in all_subscribers_by_stream, rather than a callback. We also move sub_tuples_by_user closer to the loop where we call notify_subscriptions_added.	2020-10-13 11:28:17 -07:00
Steve Howell	811426b345	Extract send_stream_creation_events_for_private_streams. We can probably avoid passing in users here.	2020-10-12 16:40:37 -07:00
Steve Howell	1cfaef0d1a	refactor: Simplify pick_color logic. This removes the need to jankily mutate the active flag in the caller, and we don't need to mutate our subs_by_user either.	2020-10-12 16:40:37 -07:00
Steve Howell	13569ff97a	refactor: Eliminate new_subs. We now just process new subs for a user immediately within the loop.	2020-10-12 16:40:37 -07:00
Steve Howell	8c70fbde78	refactor: Use subs_to_add in return value. The subs_to_add is directly related to a var called new_subs, which I hope to eliminate soon.	2020-10-12 16:40:37 -07:00
Steve Howell	1afca3d430	minor: Extract local for stream.	2020-10-12 16:40:37 -07:00
Steve Howell	84aa1389d8	Extract bulk_add_subs_to_db_with_logging. This is a trivial code extraction.	2020-10-12 16:40:37 -07:00
Steve Howell	3ff9ce78ea	refactor: Extract send_peer_add_events.	2020-10-12 16:40:37 -07:00
Cody Piersall	5dab6e9d31	emoji-upload: Fix transparency issues on GIF emoji upload. This preserves the alpha layer on GIF images that need to be resized before being uploaded. Two important changes occur here: 1. The new frame is a copy of the original image, which preserves the GIF info. 2. The disposal method of the original GIF is preserved. This essentially determines what state each frame of the GIF starts from when it is drawn; see PIL's docs: https://pillow.readthedocs.io/en/stable/handbook/image-file-formats.html#saving for more info. This resolves some but not all of the test cases in #16370.	2020-10-11 16:23:07 -07:00
Anders Kaseorg	b7a94be152	python: Catch BaseException when we need to clean something up. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:16:16 -07:00
Anders Kaseorg	7f69c1d3d5	python: Catch specific exceptions from requests. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	17ac17286c	python: Catch specific exceptions from subprocess. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Anders Kaseorg	aabef3d9be	python: Catch specific exceptions from orjson. Followup to #16120. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-11 16:11:41 -07:00
Alex Vandiver	c2132a4f9c	queue: Drop register_json_consumer / json_drain_queue interface. Now that all callsites use the same interface, drop the now-unused ones, and their tests.	2020-10-11 14:19:42 -07:00
Alex Vandiver	179c387409	tornado: Switch to start_json_consumer interface.	2020-10-11 14:19:42 -07:00
Alex Vandiver	f9358d5330	queue: Switch batch interface to use the channel.consume iterator. This low-level interface allows consuming from a queue with timeouts. This can be used to either consume in batches (with an upper timeout), or one-at-a-time. This is notably more performant than calling `.get()` repeatedly (what json_drain_queue does under the hood), which is "highly discouraged as it is very inefficient"[1]. Before this change: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11158 / sec Dequeue rate: 3075 / sec ``` After: ``` $ ./manage.py queue_rate --count 10000 --batch Purging queue... Enqueue rate: 11511 / sec Dequeue rate: 19938 / sec ``` [1] https://www.rabbitmq.com/consumers.html#fetching	2020-10-11 14:19:40 -07:00
Alex Vandiver	2547bdbf4a	queue: Rename consume_wrapper to a better name.	2020-10-09 20:40:51 -07:00
Alex Vandiver	d5a6b0f99a	queue: Rename queue_size, and update for all local queues. Despite its name, the `queue_size` method does not return the number of items in the queue; it returns the number of items that the local consumer has delivered but unprocessed. These are often, but not always, the same. RabbitMQ's queues maintain the queue of unacknowledged messages; when a consumer connects, it sends to the consumer some number of messages to handle, known as the "prefetch." This is a performance optimization, to ensure the consumer code does not need to wait for a network round-trip before having new data to consume. The default prefetch is 0, which means that RabbitMQ immediately dumps all outstanding messages to the consumer, which slowly processes and acknowledges them. If a second consumer were to connect to the same queue, they would receive no messages to process, as the first consumer has already been allocated them. If the first consumer disconnects or crashes, all prior events sent to it are then made available for other consumers on the queue. The consumer does not know the total size of the queue -- merely how many messages it has been handed. No change is made to the prefetch here; however, future changes may wish to limit the prefetch, either for memory-saving, or to allow multiple consumers to work the same queue. Rename the method to make clear that it only contains information about the local queue in the consumer, not the full RabbitMQ queue. Also include the waiting message count, which is used by the `consume()` iterator for similar purpose to the pending events list.	2020-10-09 20:40:39 -07:00
Alex Vandiver	a1ce1aca3b	queue: Update comment to be more accurate about import errors.	2020-10-09 20:40:32 -07:00
Alex Vandiver	2d71ca1fb8	email: Remove unused `log_digest_event` function. Its last callsite was removed in `e46cbaffa2`. Also ref #6786.	2020-10-08 20:35:53 -07:00
sahil839	4e8f5b5b31	streams: Change access_stream_for_delete_or_update to also return sub. We modify access_stream_for_delete_or_update function to return Subscription object also along with stream. This change will be helpful in avoiding an extra query to get subscription object in code for updating subscription role.	2020-10-08 17:07:30 -07:00
Aman Agrawal	8b419c93e4	message_send: Fix old guests being treated as full members. For streams in which only full members are allowed to post, we block guest users from posting there. Guests users were blocked from posting to admin only streams already. So now, guest users can only post to STREAM_POST_POLICY_EVERYONE streams. This is not a new feature but a bugfix which should have happened when implementing full member stream policy / guest users.	2020-10-08 11:30:11 -07:00
Anders Kaseorg	363374346c	management: Use signal.pause to sleep forever. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2020-10-07 16:15:19 -07:00
akshatdalton	52c411df8a	emoji: Add padding around the gif on GIF emoji upload. Replaced ImageOps.fit by ImageOps.pad, in zerver/lib/upload.py, which returns a sized and padded version of the image, expanded to fill the requested aspect ratio and size. Fixes part of #16370.	2020-10-06 17:28:02 -07:00
Alex Vandiver	baf882a133	queue: Only ACK drain_queue once it has completed work on the list. Currently, drain_queue and json_drain_queue ack every message as it is pulled off of the queue, until the queue is empty. This means that if the consumer crashes between pulling a batch of messages off the queue, and actually processing them, those messages will be permanently lost. Sending an ACK on every message also results in a significant amount lot of traffic to rabbitmq, with notable performance implications. Send a singular ACK after the processing has completed, by making `drain_queue` into a contextmanager. Additionally, use the `multiple` flag to ACK all of the messages at once -- or explicitly NACK the messages if processing failed. Sending a NACK will re-queue them at the front of the queue. Performance of a no-op dequeue before this change: ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10847 / sec Dequeue rate: 2479 / sec ``` Performance of a no-op dequeue after this change (a 25% increase): ``` $ ./manage.py queue_rate --count 50000 --batch Purging queue... Enqueue rate: 10752 / sec Dequeue rate: 3079 / sec ```	2020-10-06 17:26:14 -07:00
Abhijeet Prasad Bodas	a20d22de43	i18n: Move locale select logic in home to i18n.py. Part of #16094. Moved the language selection preference logic from home.py to a new function in i18n.py to avoid repetition in analytics views and home views.	2020-10-02 14:56:20 -07:00
Tim Abbott	8c8f3ee13b	test_classes: Extract home view helpers for reuse.	2020-10-01 15:14:25 -07:00

... 33 34 35 36 37 ...

7343 Commits