zulip

mirror of https://github.com/zulip/zulip.git synced 2025-10-23 04:52:12 +00:00

Author	SHA1	Message	Date
Anders Kaseorg	2ddafe728e	thumbnail: Avoid BytesIO.getbuffer. It seems to cause ‘BufferError: Existing exports of data: object cannot be re-sized’ on Python 3.13. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2025-08-13 14:11:05 -07:00
Alex Vandiver	7480510aeb	embeds: Propagate group membership before updating UserMessage flags.	2025-08-13 10:38:40 -07:00
Prakhar Pratyush	5f8edf669d	zerver: Add endpoint to register a push device to server. This commit adds an endpoint to register a push device to receive E2EE push notifications.	2025-07-14 14:52:39 -07:00
Anders Kaseorg	392ac742b4	deferred_email_senders: Correct worker class name. This name seems to have been copied from zerver.worker.email_senders.ImmediateEmailSenderWorker, but deferred is the opposite of immediate. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2025-06-25 12:03:23 -07:00
Tim Abbott	0ec07fe4c8	queue: Allow sharding user_activity worker. This follows the existing patterns for the sharded mobile notifications worker.	2025-06-06 10:33:20 -07:00
Alex Vandiver	198a5a8294	email_senders: Handle a None realm_id. Some codepaths send an event with `realm_id=None` -- which we still must trim out of the event, even if we do not add a `realm` value for them.	2025-05-22 10:30:39 -07:00
Alex Vandiver	1f0cfd4662	email-mirror: Add a standalone server that processes incoming email. Using postfix to handle the incoming email gateway complicates things a great deal: - It cannot verify that incoming email addresses exist in Zulip before accepting them; it thus accepts mail at the `RCPT TO` stage which it cannot handle, and thus must reject after the `DATA`. - It is built to handle both incoming and outgoing email, which results in subtle errors (`1c17583ad5`, `79931051bd`, `a53092687e`, #18600). - Rate-limiting happens much too late to avoid denial of service (#12501). - Mis-configurations of the HTTP endpoint can break incoming mail (#18105). Provide a replacement SMTP server which accepts incoming email on port 25, verifies that Zulip can accept the address, and that no rate-limits are being broken, and then adds it directly to the relevant queue. Removes an incorrect comment which implied that missed-message addresses were only usable once. We leave rate-limiting to only channel email addresses, since missed-message addresses are unlikely to be placed into automated systems, as channel email addresses are. Also simplifies #7814 somewhat.	2025-05-19 16:39:44 -07:00
Alex Vandiver	1e1292f2f5	email_senders: realm_id can exist but be None.	2025-05-16 11:30:48 -07:00
Aman Agrawal	136c0f1c44	registration: Enable import from slack using realm registration form. Co-authored-by: Alex Vandiver <alexmv@zulip.com> Co-authored-by: Tim Abbott <tabbott@zulip.com>	2025-05-14 13:24:38 -07:00
Alex Vandiver	6b3143d7fc	send_email: Add a flag to force all emails through the queue. Sending emails synchronously is useful because it reports configuration errors -- but it also means that occasional failures can result in ugly 500's, since those don't retry. Add a setting which forces all email to go through the `emil_senders` queue, so it can be retried as needed.	2025-04-22 10:26:25 -07:00
Anders Kaseorg	e8faa4a029	worker: Check if Sentry is initialized before calling add_breadcrumb. Otherwise we get spammed with “Dropped breadcrumb because no client bound” log messages. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2025-04-08 10:17:49 -07:00
Alex Vandiver	c5200e8b05	deliver_scheduled_emails: Use a queue, instead of infinite retries. `deliver_scheduled_emails` tries to deliver the email synchronously, and if it fails, it retries after 10 seconds. Since it does not track retries, and always tries the earliest-scheduled-but-due message first, the worker will not make forward progress if there is a persistent failure with that message, and will retry indefinitely. This can result in excessive network or email delivery charges from the remote SMTP server. Switch to delivering emails via a new queue worker. The `deliver_scheduled_emails` job now serves only to pull deferred jobs out of the table once they are due, insert them into RabbitMQ, and then delete them. This limits the potential for head-of-queue failures to failures inserting into RabbitMQ, which is more reasonable than failures speaking to a complex external system we do not control. Retries and any connections to the SMTP server are left to the RabbitMQ consumer. We build a new RabbitMQ queue, rather than use the existing `email_senders` queue, because that queue is expected to be reasonably low-latency, for things like missed message notifications. The `send_future_email` codepath which inserts into ScheduledEmails is also (ab)used to digest emails, which are extremely bursty in their frequency -- and a large burst could significantly delay emails behind it in the queue. The new queue is explicitly only for messages which were not initiated by user actions (e.g., invitation reminders, digests, new account follow-ups) which are thus not latency-sensitive. Fixes: #32463.	2025-03-04 16:09:25 -08:00
Alex Vandiver	98362de185	models: Add content_type to ImageAttachment. This means that only ImageAttachment row needs to be fetched, and removes the need to pass around an extra parameter. This denormalization is safe, since in general Attachment rows are read-only, so we are not concerned with drift between the Attachment and ImageAttachment tables. We cannot make content_type non-null, since while the both the `content_type` column in Attachment and populating that from requests predates the ImageAttachment table, we have both backfilled ImageAttachment rows to consider, and imports may also leave files with no `content_type`. Any backfill of currently-null `content_type` values will thus need to update both tables. This change fixes a race condition when importing. ImageAttachment rows are imported before rendering Messages, which are both before importing Attachment rows; if the thumbnailing finished after the Message was imported but before Attachment rows were imported, then the re-rendering step would not know the image's content-type.	2025-01-31 14:29:57 -08:00
PieterCK	a995510f0c	worker: Flag messages processed by embedded bot. This commit updates embedded bots to mark messages they have process as read. Since the service bots have their own `UserMessage` rows, this change enables us to track whether the bot has in fact processed the message by adding the `read` flag to their `UserMessage`. Fixes #28869.	2025-01-24 17:56:44 -08:00
PieterCK	cacd6bb88c	worker: Flag messages processed by outgoing bot. This commit updates outgoing bots to mark messages they process as read. Since the service bots have their own `UserMessage` rows, this change enables us to track whether the bot has in fact processed the message by adding the `read` flag to their `UserMessage`.	2025-01-24 17:56:44 -08:00
Alex Vandiver	8bd8a33dd2	thumbnail: Show the first few frames of large animated images. `71406ac767` switched the IMAGE_BOMB_TOTAL_PIXELS cutoff for what images we preview to include the number of frames in the calculation. While accurate to the implementation (thumbnailing a 1k-frame animation is prohibitive, even a small resolutions), this was a behaviour change from without thumbnailing -- animated gifs did not display inline at all anymore. Switch to thumbnailing as many frames as we can fit into a pixel-based animated thumbnailing threshold, with a minimum of three (to be able to convey that the image is actually animated). Smaller-resolution images will hence get more frames in their preview. This also allows the standard animate-on-hover or always-animate behaviour to be true to their configurations, without confusing edge cases. Fixes: #32609.	2025-01-15 09:56:19 -08:00
Alex Vandiver	230bae17bb	thumbnail: Generate a transcoded high-res version of HEIC/TIFF images. If the content-type of the image is not in INLINE_MIME_TYPES, then we do not expect browsers to be able to display it. This behaviour is particularly confusing because the thumbnail will render properly, since that will be in the more widely-supported WebP format, but the lightbox will show a broken image. In these cases, generate a high-resolution (4032x3024) "thumbnail" which clients can choose to use instead. This thumbnail format is not in the listed in the server's advertised thumbnail size list, because it is not reliably generated for every image. The transcoded thumbnail format is set on the `img` tag if it is generated, and the original content-type is always passed to the client, so it can decide how or if to render the original image. This content-type is as the _original uploader_ specified it, so may be incorrect. The transcoded image is not animated, even if the original was. HEIC files can nominally be animated, but in testing libvips was not able to correctly recognize them as such. TIFF files are parsed as being "animated," with one page per frame; this is of dubious utility, so we merely transcode the first page. Always generating a static transcoded image serves to also limit the computational time spent. THUMBNAIL_OUTPUT_FORMATS is switched to be a tuple to ensure that it is not accidentally mutated.	2025-01-09 09:10:28 -08:00
Anders Kaseorg	63aaafb94a	send_email: Parse emails in a way mypy 1.14 understands. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-12-21 21:06:53 -08:00
Anders Kaseorg	19b8cde27f	ruff: Fix PLC0206 Extracting value from dictionary without calling `.items()`. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-12-21 21:06:53 -08:00
opmkumar	5b0c55fda3	realm: Add option to schedule data deletion while deactivating. Introduce a feature to schedule realm data deletion time during realm deactivation. This includes a server-level setting to configure the minimum and maximum allowed deletion days. Co-authored-by: Ujjawal Modi <umodi2003@gmail.com> Co-authored-by: Lauryn Menard <lauryn@zulip.com> Fixes #24677.	2024-12-18 23:06:12 -08:00
Prakhar Pratyush	3bad36ef8c	queue: Rename queue_json_publish to queue_json_publish_rollback_unsafe. This commit renames the 'queue_json_publish' function to 'queue_json_publish_rollback_unsafe' to reflect the fact that it doesn't wait for the db transaction (within which it gets called, if any) to commit and sends event irrespective of commit or rollback. In most of the cases we don't want to send event in the case of rollbacks, so the caller should be aware that calling the function directly is rollback unsafe. Fixes part of #30489.	2024-12-06 09:23:02 -08:00
Alex Vandiver	19d115a9da	email_mirror: Set a short timeout on parsing incoming emails. This timeout needs to be short enough that we don't drop the RabbitMQ connection. Also drop the offending message (by returning with no further exception) so we don't hit a head-of-queue failure situation. Ideally, the parser would just be lightning-fast, so this would never happen.	2024-11-22 14:31:30 -08:00
Tim Abbott	c134cc3136	queue_processors: Disable missedmesssage worker on staging. This worker isn't designed to have multiple copies running.	2024-11-22 14:31:30 -08:00
Prakhar Pratyush	b369177341	embed_links: Add `savepoint=False` to avoid creating savepoints. It helps to avoid creating unintended savepoints in the future. This is as a part of our plan to explicitly mark all the transaction.atomic calls with either 'savepoint=False' or 'durable=True' as required.	2024-11-21 14:55:15 -08:00
Prakhar Pratyush	9c9866461a	transaction: Add `durable=True` to the outermost db transactions. This commit adds `durable=True` to the outermost db transactions created in the following: * confirm_email_change * handle_upload_pre_finish_hook * deliver_scheduled_emails * restore_data_from_archive * do_change_realm_subdomain * do_create_realm * do_deactivate_realm * do_reactivate_realm * do_delete_user * do_delete_user_preserving_messages * create_stripe_customer * process_initial_upgrade * do_update_plan * request_sponsorship * upload_message_attachment * register_remote_server * do_soft_deactivate_users * maybe_send_batched_emails It helps to avoid creating unintended savepoints in the future. This is as a part of our plan to explicitly mark all the transaction.atomic calls with either 'savepoint=False' or 'durable=True' as required. * 'savepoint=True' is used in special cases.	2024-11-05 17:58:47 -08:00
Anders Kaseorg	ac2b1cd45d	worker: Address sentry_sdk deprecations. https://docs.sentry.io/platforms/python/migration/1.x-to-2.x#scope-configuring https://github.com/getsentry/sentry-python/releases/2.0.0 https://github.com/getsentry/sentry-python/releases/2.15.0 Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-10-22 10:05:01 -07:00
Prakhar Pratyush	07dcee36b2	export_realm: Add RealmExport model. Earlier, we used to store the key data related to realm exports in RealmAuditLog. This commit adds a separate table to store those data. It includes the code to migrate the concerned existing data in RealmAuditLog to RealmExport. Fixes part of #31201.	2024-10-04 12:06:35 -07:00
Anders Kaseorg	1b4e02c5d0	thumbnail: Remove type: ignore. (An alternate solution is message_classes: list[type[Message \| ArchivedMessage]].) Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-10-04 13:54:14 -04:00
Alex Vandiver	912c1b5984	thumbnail: Tighten and clarify the "type: ignore" limitation.	2024-10-04 09:10:14 -07:00
Alex Vandiver	3cbbf2307b	thumbnail: Only lock the message row, not the Attachment row. This prevents a deadlock between the thumbnailing worker and message sending, as follows: 1. A user uploads an image, making Attachment and ImageAttachment rows, as well as enqueuing a job in the thumbnailing queue. 2. Message sending starts a transaction, creates the Message row, and calls `do_claim_attachments`, which edits the Attachment row of the upload (implicitly locking it). 3. The thumbnailing worker starts a transaction, locks the ImageAttachment row for its image, thumbnails it, and then attempts to `select_for_update()` the message objects (joined to the Attachments table) to find the ones which link to the attachment in question. This query blocks, since "a locking clause without a table list affects all tables used in the statement"[^1] and the message-send request already has a write lock on the Attachments row in question. 4. The message-send request attempts to re-fetch the ImageAttachment row inside the transaction, which tries to pull a lock on it. 5. Deadlock, because the message-send request has the Attachment lock, and waits for the ImageAttachment lock; the thumbnailing worker has the ImageAttachment lock, and waits for the Attachment lock. We break this deadlock by limiting the `update_message_rendered_content` `select_for_update` to only take the lock on the Message table, and not also the Attachments table -- no changes will be made to the Attachments, so no lock is necessary there. This allows the thumbnailing worker to successfully pull the empty list of messages (since the message-send request has not commits its transaction, and thus the Message row is not visible yet), and release its ImageAttachment lock so that the message-send request can proceed. [^1]: https://www.postgresql.org/docs/current/sql-select.html#SQL-FOR-UPDATE-SHARE	2024-10-04 09:10:14 -07:00
Prakhar Pratyush	65f465562f	export_realm: Remove the 'react on consent message' approach. For exporting full with consent: * Earlier, a message advertising users to react with thumbs up was sent and later used to determine the users who consented. * Now, we no longer need to send such a message. This commit updates the logic to use `allow_private_data_export` user-setting to determine users who consented. Fixes part of #31201.	2024-09-24 14:32:42 -07:00
Alex Vandiver	ce0df00e44	export: Notify all realm admins on realm export.	2024-09-23 10:02:43 -07:00
Alex Vandiver	b4764f49df	upload: Download files with their original names. Fixes: #29491.	2024-09-09 12:40:17 -07:00
Alex Vandiver	6f20c15ae9	thumbnail: Resolve a race condition when rendering messages. Messages are rendered outside of a transaction, for performance reasons, and then sent inside of one. This opens thumbnailing up to a race where the thumbnails have not yet been written when the message is rendered, but the message has not been sent when thumbnailing completes, causing `rewrite_thumbnailed_images` to be a no-op and the message being left with a spinner which never resolves. Explicitly lock and use he ImageAttachment data inside the message-sending transaction, to rewrite the message content with the latest information about the existing thumbnails. Despite the thumbnailing worker taking a lock on Message rows to update them, this does not lead to deadlocks -- the INSERT of the Message rows happens in a transaction, ensuring that either the message rending blocks the thumbnailing until the Message row is created, or that the `rewrite_thumbnailed_images` and Message INSERT waits until thumbnailing is complete (and updated no Message rows).	2024-08-01 16:48:16 -07:00
Mateusz Mandera	aaca394813	presence: Remove the queue worker.	2024-07-31 16:46:42 -07:00
Alex Vandiver	2ea0cc0005	thumbnail: Add a data-original-dimensions attribute. This allows clients to potentially lay out the thumbnails more intelligently, or to provide a better "progressive-load" experience when enlarging the thumbnail.	2024-07-22 22:41:10 -04:00
Alex Vandiver	65828b20e9	thumbnail: Factor out a dataclass for markdown image metadata.	2024-07-22 22:41:10 -04:00
Alex Vandiver	b42863be4b	markdown: Show thumbnails for uploaded images. Fixes: #16210.	2024-07-21 18:41:59 -07:00
Alex Vandiver	4351cc5914	thumbnail: Move get_image_thumbnail_path and split_thumbnail_path.	2024-07-18 13:50:28 -07:00
Alex Vandiver	d121a80b78	upload: Serve thumbnailed images.	2024-07-16 13:22:15 -07:00
Alex Vandiver	2e38f426f4	upload: Generate thumbnails when images are uploaded. A new table is created to track which path_id attachments are images, and for those their metadata, and which thumbnails have been created. Using path_id as the effective primary key lets us ignore if the attachment is archived or not, saving some foreign key messes. A new worker is added to observe events when rows are added to this table, and to generate and store thumbnails for those images in differing sizes and formats.	2024-07-16 13:22:15 -07:00
Anders Kaseorg	1e9b6445a9	ruff: Fix PLR6104 Use `+=` to perform an augmented assignment directly. This is a preview rule, not yet enabled by default. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-14 13:49:51 -07:00
Anders Kaseorg	b96feb34f6	ruff: Fix SIM117 Use a single `with` statement with multiple contexts. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-14 13:48:32 -07:00
Anders Kaseorg	0fa5e7f629	ruff: Fix UP035 Import from `collections.abc`, `typing` instead. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-13 22:28:22 -07:00
Anders Kaseorg	531b34cb4c	ruff: Fix UP007 Use `X \| Y` for type annotations. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-13 22:28:22 -07:00
Anders Kaseorg	e08a24e47f	ruff: Fix UP006 Use `list` instead of `List` for type annotation. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-07-13 22:28:22 -07:00
Anders Kaseorg	b115d44b6a	requirements: Upgrade Python requirements. Signed-off-by: Anders Kaseorg <anders@zulip.com>	2024-06-27 15:31:43 -07:00
Alex Vandiver	b2ebe34500	missedmessage_emails: Backoff the background worker retries.	2024-05-06 12:50:27 -07:00
Tim Abbott	0a756c652c	push_notifications: Shard mobile push notifications.	2024-05-02 14:25:10 -07:00
Alex Vandiver	572fbfe114	queue_processors: Pass the worker_num down into the class.	2024-05-02 14:25:10 -07:00

1 2 3 4 5 ...

503 Commits