isync

lobo/isync

Author	SHA1	Message	Date
Oswald Buddenhagen	b514d9ddbc	purge handling of pending sync entries from state file these cannot actually end up in the committed state. amends `bd5fb6ff`.	2020-08-04 17:16:03 +02:00
Oswald Buddenhagen	d93726067d	wrap jFprintf()+debug() into a macro this ensures that everything that is logged to the journal also appears in the debug output, and it makes the code less noisy.	2020-08-04 17:16:03 +02:00
Oswald Buddenhagen	64e5f07ad3	consistently use NULL for null pointers makes the code noisier, but also somewhat more expressive.	2020-08-04 17:16:01 +02:00
Oswald Buddenhagen	e2d3b4d55b	fix lots of sign conversion warnings ... by making a lot of objects unsigned, and some signed. casts which lose precision and change the sign in one go (ssize_t and time_t to uint on LP64) are made explicit as well.	2020-08-04 17:15:39 +02:00
Oswald Buddenhagen	cc176df2c3	make some narrowing of integers explicit this does specifically not cover about a bazillion warnings about size_t being shrunk to uint - these make no sense given the expected data set size.	2020-08-04 17:14:55 +02:00
Oswald Buddenhagen	4d7e169e57	shrink some data at the source to avoid subsequent narrowing	2020-08-04 17:14:55 +02:00
Oswald Buddenhagen	def22db096	constness fixes add missing const qualifications, and add "const cast" suppressions where unavoidable.	2020-08-04 17:14:55 +02:00
Oswald Buddenhagen	5c2e8d3e14	make more objects static	2020-08-04 17:14:55 +02:00
Oswald Buddenhagen	71d7d3e6df	add some ATTR_* (mostly) mostly ATTR_PRINTFLIKE(*, 0) for functions with a va_list argument. also, one ATTR_NORETURN and one ATTR_UNUSED, both on functions. also, an explicit suppression for a format string stored in a variable.	2020-08-04 17:13:56 +02:00
Oswald Buddenhagen	25b1c2b9e7	set sync record's flags only after propagating new message this is semantically cleaner, and fixes storing the flags in the rare case that flags are not being synced and the target is not being expunged, as in this case flags are queried only during the actual propagation.	2020-08-04 14:49:58 +02:00
Oswald Buddenhagen	abdca388f6	atomize & document conditions in load() exception list construction	2020-08-04 14:49:58 +02:00
Oswald Buddenhagen	b677bfe7e5	de-noise msg_copied() and flags_set() somewhat assign temporary srec object instead of always spelling out the indirection.	2020-08-04 14:49:58 +02:00
Oswald Buddenhagen	841f07efd0	de-noise initialization of sync records use calloc() instead of malloc().	2020-08-04 14:49:58 +02:00
Oswald Buddenhagen	2f3cb5f481	fix signedness issues surrounding UIDs amends `bb632d1c`.	2020-08-04 14:49:57 +02:00
Oswald Buddenhagen	cab14608ca	Merge branch '1.3'	2020-07-08 12:51:20 +02:00
Oswald Buddenhagen	96afe8d0c2	fix propagation of flagged oversized messages ... when not syncing flags and the target is not being expunged, as in that case flags were not queried in time.	2020-07-08 11:14:02 +02:00
Oswald Buddenhagen	e565d08246	don't try to propagate flags the target store does not support $Forwarded is not standard, so it will most likely fail with mailboxes that do not support keywords. amends `c4d7f018`.	2020-01-08 18:22:48 +01:00
Michael J Gruber	c4d7f0189c	implement Forwarded flag maildir supports a 'P' flag which denotes the fact that a message has been 'passed' on (forwarded, bounced). notmuch syncs this to the 'passed' tag. Per https://tools.ietf.org/html/rfc5788, IMAP has a user-defined flag (keyword) '$Forwarded' that is supported by many servers and clients these days. (Technically, one should check for '$Forwarded' in the server response.) Restructure mbsync's flag parser to accept keywords (flags starting with '$') but still bail out on unknown system flags (flags starting with '\'). Support '$Forwarded' as a first keyword since it maps to maildir's 'P' and needs to be sorted in between the system flags. Signed-off-by: Michael J Gruber <github@grubix.eu>	2018-07-01 12:36:28 +02:00
Michael J Gruber	e71f0ccc2a	mark MAILBOX_DRIVER_FLAG locations in code Mailbox driver flags are defined in several places. It is essential that they are kept in sync, so mark them with the same string for easy grepping with an alerting boiler plate. Signed-off-by: Michael J Gruber <github@grubix.eu>	2018-07-01 12:30:59 +02:00
Oswald Buddenhagen	c29eceaeed	make map_name() interpret empty strings as "no separator" empty strings were previously meaningless, and starting with `72c2d695a`, failure to handle them lead to bogus results when the IMAP hierarchy separator is legitimately empty (when the server genuinely supports none and none is manually configured). non-null can be asserted more cleanly than null-or-non-empty, so change the api like that. incidentally, this also removes the need to work around gcc's bogus warning in -Os mode. problem found by "Casper Ti. Vector" <caspervector@gmail.com>	2017-10-15 16:53:27 +02:00
Oswald Buddenhagen	a5d4a0fe60	make sync records with stray TUID non-fatal while the situation indicates an internal error, it is harmless in itself. also, printing some more information may help identify the problem.	2017-10-01 10:42:00 +02:00
Oswald Buddenhagen	bb632d1cd0	make UIDs unsigned complies with the IMAP spec, thus removing the (not really) arbitrary limitation to INT_MAX for UIDs.	2017-04-22 11:26:12 +02:00
Oswald Buddenhagen	a0961d6505	delay assignment of TUID when propagating messages go back to assigning TUIDs only right before actually propagating them. this avoids spurious "TUID lost" warnings.	2017-04-22 11:26:12 +02:00
Oswald Buddenhagen	bd5fb6fff3	move away from magic UIDs in the sync state the only legitimate "deviant" UID is zero, meaning "no message". this can be futher qualified by additional flags in the sync record, rather than using magic values for the UID. in fact, the zero UID (so far meaning only "expunged") was already optionally qualifed with "expired". as a side effect, driver->store_msg() now returns 0 instead of -2 for unknown UIDs. this was a hack to avoid translating the value later on, but it made the api horrible, and now it's superflous in the first place.	2017-04-22 11:26:12 +02:00
Oswald Buddenhagen	4ffe149666	split off ephemeral sync record state to a separate member this allows us to simplify logging of expiration, as we now can just log the entire persistent state instead of fiddling with bits.	2017-04-22 11:26:12 +02:00
Oswald Buddenhagen	efd72b85cc	autotest: implement much more thorough resumption verification the test will now make a test run for every journaled step, both right before and right after the logging.	2017-04-22 11:26:12 +02:00
Oswald Buddenhagen	4cc5ad5a1a	introduce driver call debugging do that by wrapping the actual stores into proxies. the proxy driver's code is auto-generated from function templates, some parameters, and the declarations of the driver functions themselves. attempts to do it with CPP macros turned out to be a nightmare.	2017-04-22 11:26:11 +02:00
Oswald Buddenhagen	bbe4567bce	let driver_t::openbox_box() return the UID validity ... and make 'uidvalidity' private to the drivers.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	48ad58b9a3	use a #define for invalid UIDVALIDITY	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	8d4918affd	introduce get_uidnext() driver callback ... and make 'uidnext' private to the imap driver.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	683e581340	let driver_t::find_new_msgs() return the list of messages consistently with driver_t::load_box().	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	74e9368121	let driver_t::load_box() return the list of messages ... and make 'msgs', 'count', and 'recent' private to the drivers.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	c886f71054	make driver_t::prepare_load_box() return the final options ... and make 'opts' private to the drivers.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	83ebe9022d	introduce get_box_path() driver callback ... and make 'path' private to the maildir driver.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	d624c9af5d	make set_bad_callback() a proper driver_t entry ... and make the pointers private to the drivers.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	f46cf8c887	provide a proper getter callback for driver capabilities that way driver_t contains only callbacks.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	d54809e268	prepend "get_" to getters in driver_t this makes it callbacks consistently start with a verb.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	36666f7e52	rewrite tracking of highest expired UID so far, we tracked the slave side, and calculated the master side on the fly. that complicated things unnecessarily.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	677accfd84	streamline syncing of old entries the order of the conditionals was purely historical (pre `4ec56f8cf`, anno 2005) and hard to follow, as were the comments.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	990c8a1404	sort uid exception list in a smarter place do it closer to where it is populated. that way the debug output is sorted, and we don't sort the list if it's known to be empty.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	34993fbca6	fix sync resumption with aborted entries we need a separate log entry type which does proper mmaxxuid tracking. while moving code around, this also removes a redundant debug statement. amends `b1842617`.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	887b2205ff	remove nonsensical statement from journal replay of aborted entries at this stage, entries cannot possibly have messages assigned to them, so trying to unlink them makes no sense. amends `b1842617`.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	5c2ce59217	fix sync resumption with re-newed messages the UID of the entries needs to be bumped from -1 to -2, as otherwise the resumed run would see a TUID in a sync entry which may not have one.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	7c466fc3e7	don't emit redundant flag updates for re-newed messages	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	1ea2e69aa7	fix maxuid tracking newmaxuid represents the highest UID for which a sync entry was created, while maxuid represents the end of the range which is guaranteed to have been propagated. that means that the former needs to be instantly incremented (and logged), while the latter must not be touched until the entire new message sync completes. this matters particularly in the case of resuming an interrupted run, where sync entry creation must resume exactly where it left off, while loading the box must use the old limit to ensure that all messages are available for actual propagation.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	6705604c4a	de-duplicate journal replay somewhat we've been using indices to separate master/slave state for a long time, so there is no point in using pairs of matching brackets to signify the side in the journal. instead, use somewhat descriptive letters (S[een], F[ind], T[rashed]) and the index itself.	2017-04-02 17:12:50 +02:00
Oswald Buddenhagen	c3350753b0	factor out jFprintf()	2017-04-02 15:24:03 +02:00
Oswald Buddenhagen	1fdf793a3f	fix signedness of 'nex' variables they are derived from srec->status, which is unsigned. for not understood reasons, the compiler complains only after extending status to a full unsigned int. on the way, localize the declarations.	2017-04-02 12:16:57 +02:00
Oswald Buddenhagen	71ced65fcc	Merge remote-tracking branch 'origin/1.2' Conflicts: src/sync.c	2017-04-01 20:31:51 +02:00
Oswald Buddenhagen	f934e995d6	don't populate sync record map with invalid UIDs this would obviously just bloat the hash with nonsense, slowing down the actual lookup later.	2017-03-14 11:36:25 +01:00
Oswald Buddenhagen	77acc26812	implement Message-Id based UIDVALIDITY recovery	2017-01-21 12:09:01 +01:00
Oswald Buddenhagen	f9fe75602e	don't fetch message size unless necessary when syncing flags but not re-newing non-fetched messages, there is no need to query the message size for all messages, as the old ones are queried only for their flags.	2017-01-21 11:41:12 +01:00
Oswald Buddenhagen	1d3b36f89e	factor out app_cr	2017-01-17 22:14:07 +01:00
Oswald Buddenhagen	3dffd68825	factor out copy_msg_convert()	2017-01-17 22:08:49 +01:00
Oswald Buddenhagen	951b7e77f8	factor out copy_msg_bytes()	2017-01-15 13:25:46 +01:00
Oswald Buddenhagen	67f4aeff1f	standardize on 'int' for message sizes that's what the sources already assumed anyway. size_t is total overkill, as No Email Ever (TM) will exceed 2GiB. this also fixes a harmless format string warning in 32 bit builds.	2016-12-29 14:10:35 +01:00
Oswald Buddenhagen	0c36655201	print actually read TUID in debug message	2016-12-26 16:20:27 +01:00
Oswald Buddenhagen	1330f43034	null-terminate lines read from state file & journal makes the subsequent code less convoluted.	2016-12-26 16:20:27 +01:00
Oswald Buddenhagen	4db64967c9	make more use of shifted_bit() technically, this introduces a redundant AND, but the compiler is smart enough to prove that (((A & M) ^ B) & M) == ((A ^ B) & M).	2016-12-18 22:03:51 +01:00
Oswald Buddenhagen	2bba9b903c	wrap message trashing into simple transactions trashing many messages at once inevitably overtaxes m$ exchange, and the connection breaks. without any progress tracking, it would restart from scratch each time, which would lead to a) it never finishing and b) many copies of the messages in the trash. full transactions as we do for "proper" syncing would be over the top, as it's not that bad if some messages get duplicated in the trash. so we record only the messages for which trashing completed, thus allowing some overlap between the attempts.	2016-11-06 09:26:16 +01:00
Oswald Buddenhagen	5b0c8cfa60	use a temporary for sanity	2016-11-05 18:16:43 +01:00
Oswald Buddenhagen	ae95490d52	pre-sort exception list passed to driver->load_box() ... and use that to optimize the maildir driver somewhat.	2016-11-05 17:32:34 +01:00
Oswald Buddenhagen	7b567164ff	abstract growable arrays somewhat ... and sneak in a C99 requirement on the way. just because.	2016-11-05 17:32:34 +01:00
Oswald Buddenhagen	7ddd8d1737	Merge branch 'isync_1_2_branch'	2015-11-08 12:04:44 +01:00
Oswald Buddenhagen	8979ebbdf2	tolerate case changes in X-TUID header name it is legal for an email system to simply change the case of rfc2822 headers, and at least one imap server apparently does just that. this would lead to us not finding our own header, which is obviously not helpful. REFMAIL: CA+fD2U3hJEszmvwBsXEpTsaWgJ2Dh373mCESM3M0kg3ZwAYjaw@mail.gmail.com	2015-09-01 15:40:54 +02:00
Oswald Buddenhagen	549e6739e8	support verbatim and real Maildir++ subfolder naming styles the legacy style is a poorly executed attempt at Maildir++, so introduce the latter for the sake of completeness. but most users will probably just want to use subfolders without any additional dots.	2015-05-01 20:53:23 +02:00
Oswald Buddenhagen	0e1f8f9a3f	revamp console output options - the old meaning of -V[V] was moved to -D{n\|N}, as these are really debugging options. - don't print the info messages by default; this can be re-enabled with the -V switch, and is implied by most debug options (it was really kind of stupid that verbose/debug operation disabled these). - the sync algo/state debugging can be separately enabled with -Ds now.	2015-03-30 10:31:26 +02:00
Oswald Buddenhagen	8aa22a62e7	make progress counters global which means they are now cumulative, and include channels and boxes.	2015-03-30 10:30:35 +02:00
Oswald Buddenhagen	a8b26dc4ac	soft-limit peak memory usage propagating many messages from a fast store (typically maildir or a local IMAP server) to a slow asynchronous store could cause gigabytes of data being buffered. avoid this by throttling fetches if the target context reports memory usage above a configurable limit. REFMAIL: 9737edb14457c71af4ed156c1be0ae59@mpcjanssen.nl	2015-02-15 18:13:05 +01:00
Oswald Buddenhagen	d9a983add6	add support for propagating folder deletions	2015-01-17 17:51:20 +01:00
Oswald Buddenhagen	926788f3ae	supplement open_box() with box existence information from list_store() there is no point in trying to open a non-existing box before trying to create it.	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	7b7304b625	split create_box() off from open_box() this allows us to do something else than creating missing boxes depending on circumstances. hypothetically, that is.	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	f1809ddd2b	open the mailboxes after loading the sync state this allows us to react differently to a box'es absence depending on the state. hypothetically, so far.	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	f43617cd94	lock sync state lazily don't try to lock it until we actually read or write it. the idea is to not fail with SyncState * if we tried to load the state before selecting a non-existing mailbox. this is ok, because if the mailbox is missing, we obviously have no sync state pertaining to it, either. as a side effect, this allows simplifying an error path.	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	fb19d644f7	split off open_box() from select_box() aka prepare_paths() reloaded. we'll need it in a moment.	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	97a42cd825	factor out {prepare,lock,save,load}_state()	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	9982e7bf08	make some driver function names more descriptive	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	00ebf45be2	rename driver::prepare_opts() => prepare_load() ... and move it to the right place in the structure and fix the doc to not claim that it is called before select().	2015-01-11 15:05:29 +01:00
Oswald Buddenhagen	42cedc8f81	introduce uchar, ushort & uint typedefs	2015-01-11 15:05:28 +01:00
Oswald Buddenhagen	b730f66f7d	Merge branch 'isync_1_1_branch' into HEAD Conflicts: src/socket.c	2015-01-11 14:32:15 +01:00
Oswald Buddenhagen	2fa75cf159	fix UID assignment with some non-UIDPLUS servers the seznam.cz IMAP server seems very eager to send UIDNEXT responses despite not supporting UIDPLUS. this doesn't appear to be a particularly sensible combination, but it's valid nonetheless. however, that means that we need to save the UIDNEXT value before we start storing messages, lest imap_find_new_msgs() will simply overlook them. we do that outside the driver, in an already present field - this actually makes the main path more consistent with the journal recovery path. analysis by Tomas Tintera <trosos@seznam.cz>. REFMAIL: 20141220215032.GA10115@kyvadlo.trosos.seznam.cz	2015-01-11 14:29:19 +01:00
Oswald Buddenhagen	958af473a0	fix conditional for early failure in cancel_done()	2015-01-02 12:38:48 +01:00
Oswald Buddenhagen	f377e7b696	introduce FieldDelimiter and InfoDelimiter options ... for windows fs compatibility. the maildir-specific InfoDelimiter inherits the global FieldDelimiter (which affects SyncState), based on the assumption that if the sync state is on a windows FS, the mailboxes certainly will be as well, while the inverse is not necessarily true (when running on unix, anyway). REFMAIL: <CA+m_8J1ynqAjHRJagvKt9sb31yz047Q7NH-ODRmHOKyfru8vtA@mail.gmail.com>	2014-10-25 17:42:48 +02:00
Oswald Buddenhagen	85fd5ceb54	move orig_name out of store_t it's state specific to the synchronizer.	2014-10-25 15:06:50 +02:00
Oswald Buddenhagen	47897d2403	fix memory management of current mailbox name it was a stupid idea to store the pointer to a variable we need to dispose in a structure which has its own lifetime.	2014-10-04 18:37:34 +02:00
Oswald Buddenhagen	4f383a8074	stop abusing memcmp() memcmp() is unfortunately not guaranteed to read forward byte-by-byte, which means that the clever use as a strncmp() without the pointless strlen()s is not permitted, and can actually misbehave with SSE-optimized string functions. so implement proper equals() and starts_with() functions. as a bonus, the calls are less cryptic.	2014-10-04 18:37:34 +02:00
Oswald Buddenhagen	526231bc22	initialize store_t::name the field is marked foreign (for the drivers), so a recycled store may contain an old pointer in it. that would make our error path crash. REFMAIL: CAF_KswU7aBS7unnK+rdZy1PG_8SZUAW=tcg75HixDLLE0w3Lhw@mail.gmail.com	2014-07-02 08:50:22 +02:00
Oswald Buddenhagen	29b07ca7a6	actually print the faulty mailbox name, not some garbage REFMAIL: CAF_KswU7aBS7unnK+rdZy1PG_8SZUAW=tcg75HixDLLE0w3Lhw@mail.gmail.com	2014-07-02 08:49:47 +02:00
Oswald Buddenhagen	2d4bc1e613	error-check committing of sync state a failure here is rather unlikely, but let's be pedantic. a failure is not fatal (we'll just enter the journal replay path next time), so only print warnings. found by coverity.	2014-04-12 18:31:18 +02:00
Oswald Buddenhagen	aa0118d047	better error messages for sync state and journal related errors we can make perfectly good use of errno here.	2014-04-12 18:30:09 +02:00
Oswald Buddenhagen	c6ddad6ac4	remove pointless/counterproductive "Disk full?" error message suffixes the affected functions will set errno to ENOSPC when necessary.	2014-04-12 18:28:21 +02:00
Oswald Buddenhagen	c5f2943ff6	don't crash in message expiration debug print we would try to print the uids from the non-existing srec of unpaired messages while preparing expiration. this would happen only if a) MaxMessages was configured and b) new messages appeared on the slave but we were not pushing, so it's a bit of a corner case. found by coverity.	2014-04-12 15:28:28 +02:00
Oswald Buddenhagen	6d2fd370a6	fix _POSIX_SYNCHRONIZED_IO usage it can be -1 for unsupported, or 0 for runtime detection (which we don't do).	2014-01-02 21:09:09 +01:00
Oswald Buddenhagen	359091625d	MaxMessages: ignore entries with no master while calculating bulk fetch	2013-12-13 15:38:50 +01:00
Oswald Buddenhagen	2bbd07ec87	adjust comments to new reality	2013-12-11 16:29:34 +01:00
Oswald Buddenhagen	5a21042e98	ensure sequencing of message propagation and store closing by putting the message propagation last, `d3f634702` uncovered a long-standing problem: we might have closed the source store before all messages were propagated from it.	2013-12-11 16:29:33 +01:00
Oswald Buddenhagen	c47ee1c8c4	fix error paths wrt sync drivers, take 3 msgs_copied() was not checked at all, and msgs_flags_set() was doing it wrong (sync_close() was not checked). instead of trying to fix/extend the msgs_flags_set() model (ref-counting and cancelation checking in lower-level functions, and return values to propagate the status), place the refs/derefs around higher-level scopes and do the checking only there. this is effectively simpler, and does away with some obscure macros.	2013-12-11 16:29:33 +01:00
Oswald Buddenhagen	03b3b566f1	reshuffle sources a bit split header and move some code to more logical places.	2013-12-08 23:19:12 +01:00
Oswald Buddenhagen	71524cb6b0	reduce FSync option to a boolean there is no use for Thorough mode any more, so simplify the configuration.	2013-12-08 11:12:09 +01:00
Oswald Buddenhagen	29a56e2dc4	don't fsync after logging every TUID as we now don't actually start propagating new messages until all TUIDs have been generated, it's sufficient to sync just once. this makes it a cheap operation, so we can do it at SYNC_NORMAL level already.	2013-12-08 11:12:09 +01:00

1 2 3 4 5 ...

280 Commits