notmuch

mirror of https://git.notmuchmail.org/git/notmuch synced 2024-11-22 19:08:09 +01:00

Author	SHA1	Message	Date
Daniel Kahn Gillmor	4dfcc8c9b2	crypto: index encrypted parts when indexopts try_decrypt is set. If we see index options that ask us to decrypt when indexing a message, and we encounter an encrypted part, we'll try to descend into it. If we can decrypt, we add the property index.decryption=success. If we can't decrypt (or recognize the encrypted type of mail), we add the property index.decryption=failure. Note that a single message may have both values of the "index.decryption" property: "success" and "failure". For example, consider a message that includes multiple layers of encryption. If we manage to decrypt the outer layer ("index.decryption=success"), but fail on the inner layer ("index.decryption=failure"). Because of the property name, this will be automatically cleared (and possibly re-set) during re-indexing. This means it will subsequently correspond to the actual semantics of the stored index.	2017-10-21 19:53:19 -03:00
Daniel Kahn Gillmor	0bb05ff693	reindex: drop all properties named with prefix "index." This allows us to create new properties that will be automatically set during indexing, and cleared during re-indexing, just by choice of property name.	2017-10-21 19:53:08 -03:00
Jani Nikula	008a5e92eb	lib: convert notmuch_bool_t to stdbool internally C99 stdbool turned 18 this year. There really is no reason to use our own, except in the library interface for backward compatibility. Convert the lib internally to stdbool.	2017-10-09 22:27:16 -03:00
David Bremner	debfae20db	lib: enforce that n_message_reindex takes headers from first file This is still a bit stopgap to be only choosing one set of headers, but this seems like a more defensible set of headers to choose.	2017-09-05 21:51:57 -03:00
David Bremner	0a40ea4b48	lib: add notmuch_message_has_maildir_flag I considered a higher level interface where the caller passes a tag name rather than a flag character, but the role of the "unread" tag is particularly confusing with such an interface.	2017-08-29 21:56:21 -03:00
David Bremner	8a8fb39b0c	lib/message: split n_m_maildir_flags_tags, store maildir flags In a future commit this will allow querying maildir flags seperately from tags to allow resolving certain conflicts.	2017-08-29 21:51:10 -03:00
Daniel Kahn Gillmor	eb232ee0ab	reindex: drop notmuch_param_t, use notmuch_indexopts_t instead There are at least three places in notmuch that can trigger an indexing action: * notmuch new * notmuch insert * notmuch reindex I have plans to add some indexing options (e.g. indexing the cleartext of encrypted parts, external filters, automated property injection) that should properly be available in all places where indexing happens. I also want those indexing options to be exposed by (and constrained by) the libnotmuch C API. This isn't yet an API break because we've never made a release with notmuch_param_t. These indexing options are relevant in the listed places (and in the libnotmuch analogues), but they aren't relevant in the other kinds of functionality that notmuch offers (e.g. dump/restore, tagging, search, show, reply). So i think a generic "param" object isn't well-suited for this case. In particular: * a param object sounds like it could contain parameters for some other (non-indexing) operation. This sounds confusing -- why would i pass non-indexing parameters to a function that only does indexing? * bremner suggests online a generic param object would actually be passed as a list of param objects, argv-style. In this case (at least in the obvious argv implementation), the params might be some sort of generic string. This introduces a problem where the API of the library doesn't grow as new options are added, which means that when code outside the library tries to use a feature, it first has to test for it, and have code to handle it not being available. The indexopts approach proposed here instead makes it clear at compile time and at dynamic link time that there is an explicit dependency on that feature, which allows automated tools to keep track of what's needed and keeps the actual code simple. My proposal adds the notmuch_indexopts_t as an opaque struct, so that we can extend the list of options without causing ABI breakage. The cost of this proposal appears to be that the "boilerplate" API increases a little bit, with a generic constructor and destructor function for the indexopts struct. More patches will follow that make use of this indexopts approach.	2017-08-23 07:55:12 -03:00
Daniel Kahn Gillmor	5b93fa6e70	lib: add notmuch_message_reindex This new function asks the database to reindex a given message. The parameter `indexopts` is currently ignored, but is intended to provide an extensible API to support e.g. changing the encryption or filtering status (e.g. whether and how certain non-plaintext parts are indexed).	2017-08-01 21:17:47 -04:00
David Bremner	34d7753992	lib: add _notmuch_message_remove_indexed_terms Testing will be provided via use in notmuch_message_reindex	2017-08-01 21:17:47 -04:00
David Bremner	8a8e2b11c2	lib: add notmuch_message_count_files This operation is relatively inexpensive, as the needed metadata is already computed by our lazy metadata fetching. The goal is to support better UI for messages with multipile files.	2017-08-01 21:17:47 -04:00
David Bremner	c040464a7c	lib: wrap use of g_mime_utils_header_decode_date This changes return type in gmime 3.0	2017-07-14 21:23:52 -03:00
Jani Nikula	30c475c1ef	build: visibility=default for library structs is no longer needed Commit `d5523ead90` ("Mark some structures in the library interface with visibility=default attribute.") fixed some mixed visibility issues with structs. With the symbol default visibility reversed, this is no longer a problem.	2017-05-13 08:38:18 -03:00
Fredrik Fornwall	e565118172	Replace index(3) with strchr(3) The index(3) function has been deprecated in POSIX since 2001 and removed in 2008, and most code in notmuch already calls strchr(3). This fixes a compilation error on Android whose libc does not have index(3).	2017-04-20 06:59:22 -03:00
David Bremner	5ce8e0b11b	lib: replace deprecated n_q_count_messages with status returning version This function was deprecated in notmuch 0.21. We re-use the name for a status returning version, and deprecate the _st name. One or two remaining uses of the (removed) non-status returning version fixed at the same time	2017-03-22 08:35:07 -03:00
David Bremner	a8a2705222	Merge branch 'release' Merge in memory fixes	2017-03-18 21:02:42 -03:00
Tomi Ollila	06adc27668	lib/message.cc: fix Coverity finding (use after free) The object where pointer to `data` was received was deleted before it was used in _notmuch_string_list_append(). Relevant Coverity messages follow: 3: extract Assigning: data = std::__cxx11::string(message->doc.()).c_str(), which extracts wrapped state from temporary of type std::__cxx11::string. 4: dtor_free The internal representation of temporary of type std::__cxx11::string is freed by its destructor. 5: use after free: Wrapper object use after free (WRAPPER_ESCAPE) Using internal representation of destroyed object local data.	2017-03-18 20:59:46 -03:00
David Bremner	62822a4e2d	lib: clamp return value of g_mime_utils_header_decode_date to >=0 For reasons not completely understood at this time, gmime (as of 2.6.22) is returning a date before 1900 on bad date input. Since this confuses some other software, we clamp such dates to 0, i.e. 1970-01-01.	2017-03-15 21:58:25 -03:00
David Bremner	7bd63833bf	lib/message.cc: use view number to invalidate cached metadata Currently the view number is incremented by notmuch_database_reopen	2017-02-25 21:15:38 -04:00
David Bremner	e0b22c139c	lib: handle DatabaseModifiedError in _n_message_ensure_metadata The retries are hardcoded to a small number, and error handling aborts than propagating errors from notmuch_database_reopen. These are both somewhat justified by the assumption that most things that can go wrong in Xapian::Database::reopen are rare and fatal. Here's the brief discussion with Xapian upstream: 24-02-2017 08:12:57 < bremner> any intuition about how likely Xapian::Database::reopen is to fail? I'm catching a DatabaseModifiedError somewhere where handling any further errors is tricky, and wondering about treating a failed reopen as as "the impossible happened, stopping" 24-02-2017 16:22:34 < olly> bremner: there should not be much scope for failure - stuff like out of memory or disk errors, which are probably a good enough excuse to stop	2017-02-25 21:13:50 -04:00
David Bremner	884dccf293	lib: make _notmuch_message_ensure_property_map static It's not called outside message.cc	2017-02-23 08:54:36 -04:00
David Bremner	3db9e94b0e	lib: make _notmuch_message_ensure_metadata static It's not called anywhere outside message.cc.	2017-02-23 08:54:25 -04:00
David Bremner	b8bb6d7964	lib: basic message-property API Initially, support get, set and removal of single key/value pair, as well as removing all properties.	2016-09-21 18:14:24 -03:00
David Bremner	4dfb69169e	lib: read "property" terms from messages. This is a first step towards providing an API to attach arbitrary (key,value) pairs to messages and retrieve all of the values for a given key.	2016-09-21 18:14:24 -03:00
Daniel Kahn Gillmor	6a833a6e83	Use https instead of http where possible Many of the external links found in the notmuch source can be resolved using https instead of http. This changeset addresses as many as i could find, without touching the e-mail corpus or expected outputs found in tests.	2016-06-05 08:32:17 -03:00
Tomi Ollila	cf09631a45	lib: whitespace cleanup Cleaned the following whitespace in lib/* files: lib/index.cc: 1 line: trailing whitespace lib/database.cc 5 lines: 8 spaces at the beginning of line lib/notmuch-private.h: 4 lines: 8 spaces at the beginning of line lib/message.cc: 1 line: trailing whitespace lib/sha1.c: 1 line: empty lines at the end of file lib/query.cc: 2 lines: 8 spaces at the beginning of line lib/gen-version-script.sh: 1 line: trailing whitespace	2016-06-05 08:23:28 -03:00
Daniel Kahn Gillmor	e366bb2227	complete ghost-on-removal-when-shared-thread-exists To fully complete the ghost-on-removal-when-shared-thread-exists proposal, we need to clear all ghost messages when the last active message is removed from a thread. Amended by db: Remove the last test of T530, as it no longer makes sense if we are garbage collecting ghost messages.	2016-04-15 07:13:49 -03:00
Daniel Kahn Gillmor	1695415039	On deletion, replace with ghost when other active messages in thread There is no need to add a ghost message upon deletion if there are no other active messages in the thread. Also, if the message being deleted was a ghost already, we can just go ahead and delete it.	2016-04-15 07:07:23 -03:00
Daniel Kahn Gillmor	9eebae3da4	Introduce _notmuch_message_has_term() It can be useful to easily tell if a given message has a given term associated with it.	2016-04-15 07:07:23 -03:00
Daniel Kahn Gillmor	604d1e0977	fix thread breakage via ghost-on-removal implement ghost-on-removal, the solution to T590-thread-breakage.sh that just adds a ghost message after removing each message. It leaks information about whether we've ever seen a given message id, but it's a fairly simple implementation. Note that _resolve_message_id_to_thread_id already introduces new message_ids to the database, so i think just searching for a given message ID may introduce the same metadata leakage.	2016-04-15 07:07:23 -03:00
Daniel Kahn Gillmor	07b6220a55	clean up stray apostrophe in comment This is a nit-picky orthographical fix for an nit-picky ontological comment.	2016-01-16 08:17:15 -04:00
Daniel Kahn Gillmor	e038b95ffe	correct comment referring to notmuch_database_remove_message notmuch_database_remove_message has no leading underscore in its name.	2016-01-16 08:16:51 -04:00
Austin Clements	7f57b747b9	lib: Add per-message last modification tracking This adds a new document value that stores the revision of the last modification to message metadata, where the revision number increases monotonically with each database commit. An alternative would be to store the wall-clock time of the last modification of each message. In principle this is simpler and has the advantage that any process can determine the current timestamp without support from libnotmuch. However, even assuming a computer's clock never goes backward and ignoring clock skew in networked environments, this has a fatal flaw. Xapian uses (optimistic) snapshot isolation, which means reads can be concurrent with writes. Given this, consider the following time line with a write and two read transactions: write \|-X-A--------------\| read 1 \|---B---\| read 2 \|---\| The write transaction modifies message X and records the wall-clock time of the modification at A. The writer hangs around for a while and later commits its change. Read 1 is concurrent with the write, so it doesn't see the change to X. It does some query and records the wall-clock time of its results at B. Transaction read 2 later starts after the write commits and queries for changes since wall-clock time B (say the reads are performing an incremental backup). Even though read 1 could not see the change to X, read 2 is told (correctly) that X has not changed since B, the time of the last read. In fact, X changed before wall-clock time A, but the change was not visible until after wall-clock time B, so read 2 misses the change to X. This is tricky to solve in full-blown snapshot isolation, but because Xapian serializes writes, we can use a simple, monotonically increasing database revision number. Furthermore, maintaining this revision number requires no more IO than a wall-clock time solution because Xapian already maintains statistics on the upper (and lower) bound of each value stream.	2015-08-13 23:52:51 +02:00
Austin Clements	e6ad3a5dd4	lib: Only sync modified message documents Previously, we updated the database copy of a message on every call to _notmuch_message_sync, even if nothing had changed. In particular, this always happens on a thaw, so a freeze/thaw pair with no modifications between still caused a database update. We only modify message documents in a handful of places, so keep track of whether the document has been modified and only sync it when necessary. This will be particularly important when we add message revision tracking.	2015-08-04 08:54:46 +02:00
David Bremner	9d192da683	lib: eliminate fprintf from _notmuch_message_file_open You may wonder why _notmuch_message_file_open_ctx has two parameters. This is because we need sometime to use a ctx which is a notmuch_message_t. While we could get the database from this, there is no easy way in C to tell type we are getting.	2015-03-29 00:34:15 +01:00
David Bremner	736ac26407	lib: replace almost all fprintfs in library with _n_d_log This is not supposed to change any functionality from an end user point of view. Note that it will eliminate some output to stderr. The query debugging output is left as is; it doesn't really fit with the current primitive logging model. The remaining "bad" fprintf will need an internal API change.	2015-03-29 00:34:15 +01:00
David Bremner	9b73a8bcc9	lib: add private function to extract the database for a message. This is needed by logging in functions outside message.cc that take only a notmuch_message_t object.	2015-03-29 00:34:15 +01:00
David Bremner	105537a809	lib: convert two "iterator copy strings" into references. Apparently this is a supported and even idiomatic way of keeping a temporary object (e.g. like that returned from an operator dereference) alive.	2015-01-02 17:18:42 +01:00
David Bremner	3d978a0d61	lib: another iterator-temporary/stale-pointer bug Tamas Szakaly points out [1] that the bug fixed in `51b073c` still exists in at least one place. This change follows the suggestion of [2] and creates a block scope temporary std::string to avoid the rules of iterators temporaries. [1]: id:20141226113755.GA64154@pamparam [2]: id:20141226230655.GA41992@pamparam	2015-01-02 17:10:37 +01:00
Austin Clements	bc9c50602d	lib: Internal support for querying and creating ghost messages This updates the message abstraction to support ghost messages: it adds a message flag that distinguishes regular messages from ghost messages, and an internal function for initializing a newly created (blank) message as a ghost message.	2014-10-25 19:26:54 +02:00
Austin Clements	d99491f274	lib: Introduce macros for bit operations These macros help clarify basic bit-twiddling code and are written to be robust against C undefined behavior of shift operators.	2014-10-25 19:26:43 +02:00
Austin Clements	7487e2e221	lib: Handle empty date value In the interest of robustness, avoid undefined behavior of sortable_unserialise if the date value is missing. This shouldn't happen now, but ghost messages will have blank date values.	2014-10-11 07:10:12 +02:00
Austin Clements	54ec8a0fd8	lib: Move message ID compression to _notmuch_message_create_for_message_id Previously, this was performed by notmuch_database_add_message. This happens to be the only caller currently (which is why this was safe), but we're about to introduce more callers, and it makes more sense to put responsibility for ID compression in the lower-level function rather than requiring each caller to handle it.	2014-10-11 07:09:54 +02:00
Jani Nikula	f42e2e43a0	lib: actually return failures from notmuch_message_tags_to_maildir_flags The function takes great care to preserve the first error status it encounters, yet fails to return that status to the caller. Fix it.	2014-09-24 20:19:34 +02:00
Austin Clements	ec573cd54f	lib: Return an error from operations that require an upgrade Previously, there was no protection against a caller invoking an operation on an old database version that would effectively corrupt the database by treating it like a newer version. According to notmuch.h, any caller that opens the database in read/write mode is supposed to check if the database needs upgrading and perform an upgrade if it does. This would protect against this, but nobody (even the CLI) actually does this. However, with features, it's easy to protect against incompatible operations on a fine-grained basis. This lightweight change allows callers to safely operate on old database versions, while preventing specific operations that would corrupt the database with an informative error message.	2014-08-30 11:39:41 -07:00
Austin Clements	5dbfed4a73	lib: Support empty header values in database Commit `567bcbc2` introduced support for storing various headers in document values. However, doing so in a backwards-compatible way meant that genuinely empty header values could not be distinguished from the old behavior of not storing the headers at all, so these required parsing the original message. Now that we have database features, new databases can declare that all messages have header values, so if we have this feature flag, we can use the stored header value even if it's the empty string. This requires slight cleanup to notmuch_message_get_header, since the code previously couldn't distinguish between empty headers and headers that are never stored in the database (previously this distinction didn't matter).	2014-08-30 11:37:33 -07:00
Austin Clements	0c1292051e	lib: Improve documentation of _notmuch_message_create_for_message_id Clarify the state of the returned message when _notmuch_message_create_for_message_id returns NOTMUCH_PRIVATE_STATUS_NO_DOCUMENT_FOUND.	2014-08-05 08:14:15 -03:00
Austin Clements	30de720ba0	lib: Invalidate message metadata in _notmuch_message_gen_terms Previously, we invalidated stored message metadata in _notmuch_message_add_term and _notmuch_message_remove_term, but not in _notmuch_message_gen_terms. This doesn't currently result in any bugs because of our limited uses of _notmuch_message_gen_terms, but it may could cause trouble in the future.	2014-08-04 18:57:55 -03:00
Charles Celerier	df8885f62c	lib: Start all function names in notmuch-private.h with As noted in devel/STYLE, every private library function should start with _notmuch. This patch corrects function naming that did not adhere to this style in lib/notmuch-private.h. In particular, the old function names that now begin with _notmuch are notmuch_sha1_of_file notmuch_sha1_of_string notmuch_message_file_close notmuch_message_file_get_header notmuch_message_file_open notmuch_message_get_author notmuch_message_set_author Signed-off-by: Charles Celerier <cceleri@cs.stanford.edu>	2014-07-13 12:25:29 -03:00
Austin Clements	dc64ab6720	lib: Separate all phrases indexed by _notmuch_message_gen_terms This adds a 100 termpos gap between all phrases indexed by _notmuch_message_gen_terms. This fixes a bug where terms from the end of one header and the beginning of another header could match together in a single phrase and a separate bug where term positions of un-prefixed terms overlapped. This fix only affects newly indexed messages. Messages that are already indexed won't benefit from this fix without re-indexing, but the fix won't make things any worse for existing messages.	2014-06-18 18:03:18 -03:00
Jani Nikula	1fa8e40561	lib: make folder: prefix literal In xapian terms, convert folder: prefix from probabilistic to boolean prefix, matching the paths, relative from the maildir root, of the message files, ignoring the maildir new and cur leaf directories. folder:foo matches all message files in foo, foo/new, and foo/cur. folder:foo/new does not match message files in foo/new. folder:"" matches all message files in the top level maildir and its new and cur subdirectories. This change constitutes a database change: bump the database version and add database upgrade support for folder: terms. The upgrade also adds path: terms. Finally, fix the folder search test for literal folder: search, as some of the folder: matching capabilities are lost in the probabilistic to boolean prefix change.	2014-03-11 19:51:22 -03:00
Jani Nikula	59823f9642	lib: add support for path: prefix searches The path: prefix is a literal boolean prefix matching the paths, relative from the maildir root, of the message files. path:foo matches all message files in foo (but not in foo/new or foo/cur). path:foo/new matches all message files in foo/new. path:"" matches all message files in the top level maildir. path:foo/ matches all message files in foo and recursively in all subdirectories of foo. path: matches all message files recursively, i.e. all messages.	2014-03-11 19:51:22 -03:00
Jani Nikula	4d150eba67	lib: refactor folder term update after filename removal Abstract some blocks of code for reuse. No functional changes.	2014-03-11 19:51:22 -03:00
Tomi Valkeinen	075d53dde5	lib: fix error handling Currently if a Xapian exception happens in notmuch_message_get_header, the exception is not caught leading to crash. In notmuch_message_get_date the exception is caught, but an internal error is raised, again leading to crash. This patch fixes the error handling by making both functions catch the Xapian exceptions, print an error and return NULL or 0. The 'notmuch->exception_reported' is also set, as is done elsewhere, even if I don't really get the idea of that field. Signed-off-by: Tomi Valkeinen <tomi.valkeinen@iki.fi>	2014-01-18 14:47:35 -04:00
Louis Rilling	a9b2135c75	tags_to_maildir_flags: Don't rename if no flags change notmuch_message_tags_to_maildir_flags() unconditionally moves messages from maildir directory "new/" to maildir directory "cur/", which makes messages lose their "new" status in the MUA. However some users want to keep this "new" status after, for instance, an auto-tagging of new messages. However, as Austin mentioned and according to the maildir specification, messages living in "new/" are not allowed to have flags, even if mutt allows it to happen. For this reason, this patch prevents moving messages from "new/" to "cur/", only if no flags have to be changed. It's hopefully enough to satisfy mutt (and maybe other MUAs showing the "new" status) users checking the "new" status. Changelog: * v2: Fix bool type as well as NULL returned despite having no errors (Austin Clements) * v4: Tag the related test (contributed by Michal Sojka) as working Signed-off-by: Louis Rilling <l.rilling@av7.net> [Condition for keeping messages in new/ was extended to satisfy all tests from the previous patch. -Michal Sojka] [Added by David Bremner, to keep the tests passing at each commit] update insert tests for new maildir synchronization rules As of id:1355952747-27350-4-git-send-email-sojkam1@fel.cvut.cz we are more conservative about moving messages from ./new to ./cur. This updates the insert tests to match	2013-09-03 20:41:51 -03:00
Vladimir Marek	51b073c6f2	lib/message.cc: stale pointer bug (v3) Xapian::TermIterator::operator* returns std::string which is destroyed as soon as (*i).c_str() finishes. The remembered pointer 'term' then references invalid memory. Signed-off-by: Vladimir Marek <vlmarek@volny.cz>	2013-05-03 21:17:56 -03:00
Austin Clements	5394924e6c	lib: Separate list of all messages from top-level messages Previously, thread.cc built up a list of all messages, then proceeded to tear it apart to transform it into a list of top-level messages. Now we simply build a new list of top-level messages. This simplifies the interface to _notmuch_message_add_reply, eliminates the pointer acrobatics from _resolve_thread_relationships, and will enable us to do things with the list of all messages in the following patches.	2013-02-18 20:20:24 -04:00
Jani Nikula	5505d55515	lib: fix warnings when building with clang Building notmuch with CC=clang and CXX=clang++ produces the warnings: CC -O2 lib/tags.o lib/tags.c:43:5: warning: expression result unused [-Wunused-value] talloc_steal (tags, list); ^~~~~~~~~~~~~~~~~~~~~~~~~ /usr/include/talloc.h:345:143: note: expanded from: ...__location__); __talloc_steal_ret; }) ^~~~~~~~~~~~~~~~~~ 1 warning generated. CXX -O2 lib/message.o lib/message.cc:791:5: warning: expression result unused [-Wunused-value] talloc_reference (message, message->tag_list); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/include/talloc.h:932:36: note: expanded from: ...(_TALLOC_TYPEOF(ptr))_talloc_reference_loc((ctx),(ptr), __location__) ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1 warning generated. Check talloc_reference() return value, and explicitly ignore talloc_steal() return value as it has no failure modes, to silence the warnings.	2012-12-01 08:10:32 -04:00
Austin Clements	b88030bda6	lib: Treat messages in new/ as maildir messages with no flags set Previously, notmuch new only synchronized maildir flags to tags for files with a maildir "info" part. Since messages in new/ don't have an info part, notmuch would ignore them for flag-to-tag synchronization. This patch makes notmuch consider messages in new/ to be legitimate maildir messages that simply have no maildir flags set. The most visible effect of this is that such messages now automatically get the unread tag.	2012-06-10 20:14:56 -03:00
Austin Clements	750231bae8	lib: Only synchronize maildir flags for messages in maildirs Previously, we synchronized flags to tags for any message that looked like it had maildir flags in its file name, regardless of whether it was in a maildir-like directory structure. This was asymmetric with tag-to-flag synchronization, which only applied to messages in directories named new/ and cur/ (introduced by `95dd5fe5`). This change makes our interpretation stricter and addresses this asymmetry by only synchronizing flags to tags for messages in directories named new/ or cur/. It also prepares us to treat messages in new/ as maildir messages, even though they lack maildir flags.	2012-06-10 20:13:58 -03:00
Austin Clements	93ab4c7d11	lib: Move _filename_is_in_maildir This way notmuch_message_maildir_flags_to_tags can call it. It makes more sense for this to be just above all of the maildir synchronization code rather than mixed in the middle.	2012-06-10 20:13:45 -03:00
Austin Clements	d9f61c26a1	lib: Don't needlessly create directory docs in _notmuch_message_remove_filename Previously, if passed a filename with a directory that did not exist in the database, _notmuch_message_remove_filename would needlessly create that directory document. Fix it so that doesn't happen.	2012-05-23 22:32:12 -03:00
Austin Clements	67ae2377a9	lib: Perform the same transformation to _notmuch_database_filename_to_direntry Now _notmuch_database_filename_to_direntry takes a flags argument and can indicate if the necessary directory documents do not exist. Again, callers have been updated, but retain their original behavior.	2012-05-23 22:30:43 -03:00
Louis Rilling	b9360be2bd	tags_to_maildir_flags: Cleanup double assignement The for loop right after already does the job. Signed-off-by: Louis Rilling <l.rilling@av7.net>	2011-11-21 20:32:32 -04:00
Louis Rilling	21b13c3932	lib: Kill last usage of C++ type bool Signed-off-by: Louis Rilling <l.rilling@av7.net>	2011-11-21 20:32:07 -04:00
Austin Clements	567bcbc294	Store "from" and "subject" headers in the database. This is a rebase and cleanup of Istvan Marko's patch from id:m3pqnj2j7a.fsf@zsu.kismala.com Search retrieves these headers for every message in the search results. Previously, this required opening and parsing every message file. Storing them directly in the database significantly reduces IO and computation, speeding up search by between 50% and 10X. Taking full advantage of this requires a database rebuild, but it will fall back to the old behavior for messages that do not have headers stored in the database.	2011-11-14 17:10:58 -04:00
Ali Polatel	02a3076711	lib: make find_message{,by_filename) report errors Previously, the functions notmuch_database_find_message() and notmuch_database_find_message_by_filename() functions did not properly report error condition to the library user. For more information, read the thread on the notmuch mailing list starting with my mail "id:871uv2unfd.fsf@gmail.com" Make these functions accept a pointer to 'notmuch_message_t' as argument and return notmuch_status_t which may be used to check for any error condition. restore: Modify for the new notmuch_database_find_message() new: Modify for the new notmuch_database_find_message_by_filename()	2011-10-04 07:55:29 +03:00
Austin Clements	bfe4555325	lib: Remove message document directly after removing the last file name. Previously, notmuch_database_remove_message would remove the message file name, sync the change to the message document, re-find the message document, and then delete it if there were no more file names. An interruption after sync'ing would result in a file-name-less, permanently un-removable zombie message that would produce errors and odd results in searches. We could wrap this in an atomic section, but it's much simpler to eliminate the round-about approach and just delete the message document instead of sync'ing it if we removed the last filename.	2011-09-23 21:50:39 -04:00
Austin Clements	e4379c43e2	lib: Indicate if there are more filenames after removal. Make _notmuch_message_remove_filename return NOTMUCH_STATUS_DUPLICATE_MESSAGE_ID if the message has more filenames and fix callers to handle this.	2011-09-23 21:50:39 -04:00
Austin Clements	62445dd023	lib: Add missing status check in _notmuch_message_remove_filename. Previously, this function would synchronize the folder list even if removing the file name failed. Now it returns immediately if removing the file name fails.	2011-09-12 23:36:00 -03:00
Mark Anderson	8a856e5c38	Fix folder: coherence issue Add removal of all ZXFOLDER terms to removal of all XFOLDER terms for each message filename removal. The existing filename-list reindexing will put all the needed terms back in. Test search-folder-coherence now passes. Signed-off-by:Mark Anderson <ma.skies@gmail.com>	2011-06-29 14:13:16 -07:00
Pieter Praet	8bb6f7869c	fix sum moar typos [comments in source code] Various typo fixes in comments within the source code. Signed-off-by: Pieter Praet <pieter@praet.org> Edited-by: Carl Worth <cworth@cworth.org> Restricted to just source-code comments, (and fixed fix of "descriptios" to "descriptors" rather than "descriptions").	2011-06-23 15:58:39 -07:00
Carl Worth	d5523ead90	Mark some structures in the library interface with visibility=default attribute. As of gcc 4.6, there are new warnings from -Wattributes along the lines of: warning: ‘_notmuch_messages’ declared with greater visibility than the type of its field ‘_notmuch_messages::iterator’ [-Wattributes] To squelch these, we decorate all such containing structs with __attribute__((visibility("default"))). We take care to let only the C++ compiler see this, (since the C compiler would otherwise warn about ignored visibility attributes on types).	2011-05-11 13:27:15 -07:00
Carl Worth	2f3a76c569	Remove some variables which were set but not used. gcc (at least as of version 4.6.0) is kind enough to point these out to us, (when given -Wunused-but-set-variable explicitly or implicitly via -Wunused or -Wall). One of these cases was a legitimately unused variable. Two were simply variables (named ignored) we were assigning only to squelch a warning about unused function return values. I don't seem to be getting those warnings even without setting the ignored variable. And the gcc docs. say that the correct way to squelch that warning is with a cast to (void) anyway.	2011-05-11 13:27:14 -07:00
Austin Clements	d19c5de17a	Add the tag list to the unified message metadata pass. Now each caller of notmuch_message_get_tags only gets a new iterator, instead of a whole new list. In principle this could cause problems with iterating while modifying tags, but through the magic of talloc references, we keep the old tag list alive even after the cache in the message object is invalidated. This reduces my index search from the 3.102 seconds before the unified metadata pass to 1.811 seconds (1.7X faster). Combined with the thread search optimization in `b3caef1f06`, that makes this query 2.5X faster than when I started.	2011-03-21 02:45:18 -04:00
Austin Clements	f271071330	Add the file name list to the unified message metadata pass. Even if the caller never uses the file names, there is little cost to simply fetching the file name terms. However, retrieving the full paths requires additional database work, so the expansion from terms to full paths is performed lazily. This also simplifies clearing the filename cache, since that's now handled by the generic metadata cache code. This further reduces my inbox search from 3.102 seconds before the unified metadata pass to 2.206 seconds (1.4X faster).	2011-03-21 02:45:18 -04:00
Austin Clements	206938ec9b	Add a generic function to get a list of terms with some prefix. Replace _notmuch_convert_tags with this and simplify _create_filenames_for_terms_with_prefix. This will also come in handy shortly to get the message file name list.	2011-03-21 02:45:18 -04:00
Austin Clements	f3c1eebfaf	Implement an internal generic string list and use it. This replaces the guts of the filename list and tag list, making those interfaces simple iterators over the generic string list. The directory, message filename, and tags-related code now build generic string lists and then wraps them in specific iterators. The real wins come in later patches, when we use these for even more generic functionality. As a nice side-effect, this also eliminates the annoying dependency on GList in the tag list.	2011-03-21 02:45:18 -04:00
Austin Clements	d9b0ae918f	Use a single unified pass to fetch scalar message metadata. This performs a single pass over a message's term list to fetch the thread ID, message ID, and reply-to, rather than requiring a pass for each. Xapian decompresses the term list anew for each iteration, so this reduces the amount of time spent decompressing message metadata. This reduces my inbox search from 3.102 seconds to 2.555 seconds (1.2X faster).	2011-03-21 02:45:18 -04:00
Carl Worth	db70f3f0c4	lib: Save and restore term position in message while indexing. This fixes the recently addead search-position-overlap bug as demonstrated in the test of the same name.	2011-01-26 15:59:19 +10:00
Carl Worth	99cfa27030	Add support for folder-based searching. A new "folder:" prefix in the query string can now be used to match the directories in which mail files are stored. The addition of this feature causes the recently added search-by-folder tests to now pass.	2011-01-15 15:37:43 -08:00
Carl Worth	36161181df	Correct some minor typos in a comment Nothing too important here. Just some misspellings I noticed while reading nearby code.	2011-01-15 15:37:43 -08:00
Austin Clements	b3caef1f06	Optimize thread search using matched docid sets. This reduces thread search's 1+2t Xapian queries (where t is the number of matched threads) to 1+t queries and constructs exactly one notmuch_message_t for each message instead of 2 to 3. notmuch_query_search_threads eagerly fetches the docids of all messages matching the user query instead of lazily constructing message objects and fetching thread ID's from term lists. _notmuch_thread_create takes a seed docid and the set of all matched docids and uses a single Xapian query to expand this docid to its containing thread, using the matched docid set to determine which messages in the thread match the user query instead of using a second Xapian query. This reduces the amount of time required to load my inbox from 4.523 seconds to 3.025 seconds (1.5X faster).	2010-12-07 16:40:05 -08:00
Carl Worth	7278383005	lib: Fix missing initialization of status field. This could have been a problematic bug. Fortuinately "gcc -O2" warns about it.	2010-11-11 20:54:41 -08:00
Carl Worth	fe8eeaf4a5	lib: Add two missing static qualifiers The debian packaging is nice enough to notice when we accidentally leak private symbols to the public interface.	2010-11-11 20:53:21 -08:00
Carl Worth	96d99c3837	tags_to_maildir_flags: Fix to preserve existing, unsupported flags This is to prevent notmuch from destroying any information the user has encoded as flags in the maildir filename. Tests are also added to the test suite to verify the documented behavior.	2010-11-11 16:36:02 -08:00
Carl Worth	95dd5fe5d7	notmuch_message_tags_to_maildir_flags: Do nothing outside of "new" and "cur" Some people use notmuch with non-maildir files, (for example, email messages in MH format, or else cool things like using sluk[] to suck down feeds into a format that notmuch can index). To better support uses like that, don't do any renaming for files that are not in a directory named either "new" or "cur". [] https://github.com/krl/sluk/	2010-11-11 14:32:17 -08:00
Carl Worth	37a8096fdc	notmuch_message_tags_to_maildir_flags: Don't exit on failure to rename. It is totally legitimate for a non-maildir directory to be named "new" (and not have a directory next to it named "cur"). To support this case at least, be silent about any rename failure.	2010-11-11 03:50:42 -08:00
Carl Worth	71a3201885	notmuch_message_tags_to_maildir_flags: Fix to rename multiple files This function was documented as modifying every filename associated with the message. Fix it to actually do that.	2010-11-11 03:47:11 -08:00
Carl Worth	404db1de90	maildir_flags_to_tags: Avoid interpreting "no info" as "no flags set". If a filename has no maildir info at all, (that is, it does not contain the sequence ":2,"), we consider this distinct from a filename with an empty maildir info, (the ":2," separator is present, but no flags characters follow). Specifically, we regard a missing info field as providing no information, so tags will remain unchanged. On the other hand, an info field that is present but has no flags set will cause various tags to be cleared, (or in the case of "unread", added). This fixes the "remove info" case of the maildir-sync tests in the test suite.	2010-11-11 03:40:19 -08:00
Carl Worth	81cbaafc0f	Fix notmuch_message_tags_to_maildir_flags to effect rename immediately We have tests to ensure that when the notmuch library renames a file that that rename takes place immediately in the database, (without requiring something like "notmuch new" to notice the change). This was working when the code was first added, but recently broke in the reworking of the maildir-synchronization interface since the tags_to_maildir_flags function can no longer assume that it is being called as part of _notmuch_message_sync. Fortunately, the fix is as simple as adding an explicit call to _notmuch_message_sync.	2010-11-11 03:40:19 -08:00
Carl Worth	4b6063397f	Fix notmuch_message_maildir_flags_to_tags to iterate over filenames As documented, this function now iterates over all filenames for the message, computing a logical OR of the flags set on the filenames, then uses the final result to set tags on the message. This change fixes 3 of the 10 maildir-sync tests that have been failing since being added.	2010-11-11 03:40:19 -08:00
Carl Worth	1d02dd64af	lib: Add new, public notmuch_message_get_filenames This augments the existing notmuch_message_get_filename by allowing the caller access to all filenames in the case of multiple files for a single message. To support this, we split the iterator (notmuch_filenames_t) away from the list storage (notmuch_filename_list_t) where previously these were a single object (notmuch_filenames_t). Then, whenever the user asks for a file or filename, the message object lazily creates a complete notmuch_filename_list_t and then: For notmuch_message_get_filename, returns the first filename in the list. For notmuch_message_get_filenames, creates and returns a new iterator for the filename list.	2010-11-11 03:40:19 -08:00
Carl Worth	d422dcf0a2	lib: Remove the notion of TAGS_INVALID This rather ugly hack was recently obviated by the removal of the notmuch_database_set_maildir_sync function. Now, clients must make explicit calls to do any syncrhonization between maildir flags and tags. So the library no longer needs to worry about doing inconsistent synchronization while a message is only partially added.	2010-11-11 03:40:19 -08:00
Carl Worth	bb74e9dff8	lib: Rework interface for maildir_flags synchronization Instead of having an API for setting a library-wide flag for synchronization (notmuch_database_set_maildir_sync) we instead implement maildir synchronization with two new library functions: notmuch_message_maildir_flags_to_tags and notmuch_message_tags_to_maildir_flags These functions are nicely documented here, (though the implementation does not quite match the documentation yet---as plainly evidenced by the current results of the test suite).	2010-11-11 03:40:19 -08:00
Carl Worth	2c262042ac	lib: Remove the synchronization of 'T' flag with "deleted" tag. Tags in a notmuch database affect all messages with the identical message-ID. But maildir tags affect individual files. And since multiple files can contain the identical message-ID, there is not a one-to-one correspondence between messages affected by tags and flags. This is particularly dangerous with the 'T' (== "trashed") maildir flag and the corresponding "deleted" tag in the notmuch database. Since these flags/tags are often used to trigger irreversible deletion operations, the lack of one-to-one correspondence can be potentially dangerous. For example, consider the following sequence: 1. A third-party application is used to identify duplicate messages in the mail store, and mark all-but-one of each duplicate with the 'T' flag for subsequent deletion. 2. A "notmuch new" operation reads that 'T' flag, adding the "deleted" flag to the corresponding messages within the notmuch database. 3. A subsequent notmuch operation, (such as a "notmuch dump; notmuch restore" cycle) synchronized the "deleted" tag back to the mail store, applying the 'T' flag to all(!) filenames with duplicate message IDs. 4. A third-party application reads the 'T' flags and irreversibly deletes all mail messages which had any duplicates(!). In order to avoid this scenario, we simply refuse to synchronize the 'T' flag with the "deleted" tag. Instead, applications can set 'T' and act on it to delete files, or can set "deleted" and act on it to delete files. But in either case the semantics are clear and there is never dangerous propagation through the one-to-many mapping of notmuch message objects to files.	2010-11-11 02:35:03 -08:00
Michal Sojka	d9d3d3e6f0	Make maildir synchronization configurable This adds group [maildir] and key 'synchronize_flags' to the configuration file. Its value enables (true) or diables (false) the synchronization between notmuch tags and maildir flags. By default, the synchronization is disabled.	2010-11-10 13:09:32 -08:00
Michal Sojka	088801a14a	Maildir synchronization This patch allows bi-directional synchronization between maildir flags and certain tags. The flag-to-tag mapping is defined by flag2tag array. The synchronization works this way: 1) Whenever notmuch new is executed, the following happens: o New messages are tagged with configured new_tags. o For new or renamed messages with maildir info present in the file name, the tags defined in flag2tag are either added or removed depending on the flags from the file name. 2) Whenever notmuch tag (or notmuch restore) is executed, a new set of flags based on the tags is constructed for every message and a new file name is prepared based on the old file name but with the new flags. If the flags differs and the old message was in 'new' directory then this is replaced with 'cur' in the new file name. If the new and old file names differ, the file is renamed and notmuch database is updated accordingly. The rename happens before the database is updated. In case of crash between rename and database update, the next run of notmuch new brings the database in sync with the mail store again.	2010-11-10 13:09:31 -08:00
Carl Worth	d064bd696c	lib: Eliminate some redundant includes of xapian.h Most files including this already include database-private.h which includes xapian.h already.	2010-11-01 23:24:40 -07:00
Carl Worth	98845fdbb2	Avoid database corruption by not adding partially-constructed mail documents. Previously we were using Xapian's add_document to allocate document ID values for notmuch_message_t objects. This had the drawback of adding a partially constructed mail document to the database. If notmuch was subsequently interrupted before fully populating this document, then later runs would be quite confused when seeing the partial documents. There are reports from the wild of people hitting internal errors of the form "Message ... has no thread ID" for example, (which is currently an unrecoverable error). We fix this by manually allocating document IDs without adding documents. With this change, we never call Xapian's add_document method, but only replace_document with either the current document ID of a message or a new one that we have allocated.	2010-06-04 10:16:53 -07:00
Carl Worth	361b9d4bd9	Fix misnamed function in internal documentation. The documentation for several functions mentioned _notmuch_message_set_sync which doesn't exist. Fix these to reference _notmuch_message_sync instead.	2010-06-04 09:54:46 -07:00

1 2 3 4

184 commits