notmuch

mirror of https://git.notmuchmail.org/git/notmuch synced 2024-12-22 01:14:53 +01:00

Author	SHA1	Message	Date
Jakub Wilk	1b878877d9	doc: fix typos	2023-04-13 11:14:58 -03:00
David Bremner	2e5ef69fbf	lib: add field processor for lastmod: prefix By sharing the existing logic used by the sexp query parser, this allows negative lastmod revisions to be interpreted as relative to the most recent revision.	2022-09-03 08:43:33 -03:00
David Bremner	606d9b02e4	lib/sexp: provide relative lastmod queries Test the relatively trivial logic changes for the sexp query parser first before refactoring that logic to share with the infix query parser.	2022-09-03 08:36:53 -03:00
David Bremner	d73ddec5b8	doc/sexp: document range queries. Give examples for date fields, as these are commonly useful for user queries.	2022-06-25 19:52:42 -03:00
David Bremner	37492858b6	doc/sexp-queries: be more consistent about ending example text Most, but not all of the the explanatory texts already end in '.'. This makes the remainder match.	2022-04-30 14:22:51 -03:00
David Bremner	5e90c847d7	doc/sexp-queries: fix example Update the explanatory text to match the query.	2022-04-30 14:13:03 -03:00
David Bremner	b2eb0547e1	doc/sexp-queries: escape @ in non-verbatim text This prevents sphinx-doc from creating spurious mailto: links. Thanks to Jakub Wilk for telling me about the fix.	2022-04-30 13:46:51 -03:00
David Bremner	a1d139de4d	lib: add sexp: prefix to Xapian (infix) query parser. This is analogous to the "infix" prefix provided by the s-expression based query parser.	2022-04-15 08:25:46 -03:00
David Bremner	8ed6a172b3	lib: do not phrase parse prefixed bracketed subexpressions Since Xapian does not preserve quotes when passing the subquery to a field processor, we have to make a guess as to what the user intended. Here the added assumption is that a string surrounded by parens is not intended to be a phrase.	2022-03-19 07:27:29 -03:00
David Bremner	5c183d814b	doc: replace :math: for subscripts in notmuch-sexp-queries(7) Sphinx pulls in mathjax to do :math:, which complicates viewing the html version offline. The direct :sub: is supported directly in html.	2022-01-29 16:37:59 -04:00
David Bremner	b264a49be3	rename built_with.sexpr_query to built_with.sexp_queries It is confusing to use two different names (sexp vs sexpr) when compared with the command line option --query=sexp and (furthermore) singular vs plural when compared with the man page title.	2021-12-03 20:06:06 -04:00
David Bremner	93104f0d9d	doc/sexp-queries: replace definition lists with block quotes. This document contains meaningful markup in the terms, which makeinfo complains about. Replace the use of definition lists with regular paragraphs containing quote blocks. This is accomplished by splitting the "term" from the definition with a blank line.	2021-10-25 08:25:35 -03:00
David Bremner	00fdf10937	doc: remove explicit formatting of terms in definition lists Sphinx-doc already formats the terms appropriately for a given backend (bold in html and man). `makeinfo` complains noisily about formatting inside a @item if we add our own explicit formatting. This change may change the formatting in the info output. On the other hand, the existing use of quotes for bold is not that great anyway. In some places blank lines were removed to preserve the logical structure of a definition list.	2021-10-25 08:25:22 -03:00
David Bremner	49aa44bb01	doc/sexp-queries: update synopsis and description I chose to go with a somewhat terse synopsis to try to keep the length of the page down.	2021-09-04 17:07:19 -07:00
David Bremner	551254eb76	lib/parse-sexp: apply macros Macros implement lazy evaluation and lexical scope. The former is needed to make certain natural constructs work sensibly (e.g. (tag ,param)) but the latter is mainly future-proofing in case the DSL is is extended to allow local bindings. For technical background, see chapters 6 and 17 of [1] (or some other intermediate programming languages textbook). [1] http://cs.brown.edu/courses/cs173/2012/book/	2021-09-04 17:07:19 -07:00
David Bremner	6ab2d9b1a2	lib/parse-sexp: handle saved queries This provides functionality analogous to query: in the Xapian QueryParser based parser. Perhaps counterintuitively, the saved queries currently have to be in the original query syntax (i.e. not s-expressions).	2021-09-04 17:07:19 -07:00
David Bremner	a07ef8abf5	lib/parse-sexp: parse user headers One subtle aspect is the replacement of _find_prefix with _notmuch_database_prefix, which understands user headers. Otherwise the code mainly consists of creating a fake prefix record (since the user prefixes are not in the prefix table) and error handling.	2021-09-04 17:07:19 -07:00
David Bremner	cc5992a304	lib/parse-sexp: support infix subqueries This is necessary so that programs can take infix syntax queries from a user and use the sexp query syntax to construct e.g. a refinement of that query.	2021-09-04 17:07:19 -07:00
David Bremner	afe85e6578	lib/parse-sexp: expand queries The code here is just gluing together _notmuch_query_expand with the existing sexp parser infrastructure.	2021-09-04 17:07:19 -07:00
David Bremner	1870b3ae4b	lib/parse-sexp: support regular expressions At least to the degree that the Xapian QueryParser based parser also supports them. Support short alias 'rx' as it seems to make more complex queries nicer to read.	2021-09-04 17:07:19 -07:00
David Bremner	0ca4ad2670	lib/parse-sexp: add '' as syntactic sugar for '(starts-with "")' Users that insist on using a literal '' as a tag, can continue to do so by quoting it when searching.	2021-09-04 17:07:19 -07:00
David Bremner	011d06f4d6	lib/parse-sexp: 'starts-with' wildcard searches The many tests potentially overkill, but they could catch typos in the prefixes table. As a simplifying assumption, for now we assume a single argument to the wildcard operator, as this matches the Xapian semantics. The name 'starts-with' is chosen to emphasize the supported case of wildcards in currrent (1.4.x) Xapian.	2021-09-04 17:07:19 -07:00
David Bremner	8322f536f5	lib/parse-sexp: add term prefix backed fields We use "boolean" to describe fields that should generate terms literally without stemming or phrase splitting. This terminology might not be ideal but it is already enshrined in notmuch-search-terms(7).	2021-09-04 17:07:19 -07:00
David Bremner	90d9c2ad5c	lib/parse-sexp: support phrase queries. Anything that is quoted or not purely word characters is considered a phrase. Phrases are not stemmed, because the stems do not have positional information in the database. It is less efficient to scan the term twice, but it avoids a second pass to add prefixes, so maybe it balances out. In any case, it seems unlikely query parsing is very often a bottleneck.	2021-09-04 17:07:19 -07:00
David Bremner	200e164dc7	lib/parse-sexp: support subject field The broken tests are because we do not yet handle phrase searches.	2021-09-04 17:07:19 -07:00
David Bremner	f83cd2a05a	lib/parse-sexp: support and, not, and or. All operations and (Xapian) fields will eventually have an entry in the prefixes table. The flags field is just a placeholder for now, but will eventually distinguish between various kinds of prefixes.	2021-09-04 17:07:19 -07:00
David Bremner	a2785c3919	lib/parse-sexp: stem unquoted atoms This is somewhat less DWIM than the Xapian query parser, but it has the advantage of simplicity.	2021-09-04 17:07:19 -07:00
David Bremner	be7e83de96	lib/parse-sexp: parse single terms and the empty list. There is not much of a parser here yet, but it already does some useful error reporting. Most functionality sketched in the documentation is not implemented yet; detailed documentation will follow with the implementation.	2021-09-04 17:07:19 -07:00
Jani Nikula	ff4e81ac57	doc: cross-reference notmuch man pages with actual links Add internal hyperlink targets for man pages and cross-reference them using the any role reference. There are a number of alternatives to accomplish this, but this seems like the combination that retains the man page section number and the same boldface style in the man pages. As a bonus, we get sanity checking on the links; for example notmuch-search-terms.rst had a reference to notmuch-properties(1) i.e. the wrong section. The obvious semantic follow-up change would be to only have meaningful "see also" references instead of having them all everywhere.	2021-05-22 16:38:56 -03:00
Jani Nikula	3baa61e0e5	doc: use manpage role references to external man pages Using manpage role references generates helpful links in html documentation, while retaining the same boldface style in the man pages. The external man page site is configurable. The Debian manpage site seems like a good fit for Notmuch.	2021-05-22 09:56:52 -03:00
Tomi Ollila	507d2f07a6	doc: field processor support now always included, adjust manual pages The features that require field processor support, are now just documented w/o mentioning Xapian Field Processors' is needed for those. Replaced "compact" and "field_processor" with "retry_lock" in build_with config option, as it is currently the only one that is optionally excluded. The former 2 are now documented as features always included. Dropped one 'we' "passive" in notmuch-search-terms.rst. It was the only one, and inconsistent with rest of the documentation in that file. Dropped message about conditional open-ended ranges support, as those are now always supported.	2020-06-06 07:54:34 -03:00
Daniel Kahn Gillmor	4b1a8fd183	index: repair "Mixed Up" messages before indexing. When encountering a message that has been mangled in the "mixed up" way by an intermediate MTA, notmuch should instead repair it and index the repaired form. When it does this, it also associates the index.repaired=mixedup property with the message. If a problem is found with this repair process, or an improved repair process is proposed later, this should make it easy for people to reindex the relevant message. The property will also hopefully make it easier to diagnose this particular problem in the future. Signed-off-by: Daniel Kahn Gillmor <dkg@fifthhorseman.net>	2019-09-15 19:07:06 -04:00
Daniel Kahn Gillmor	9829533e92	index: avoid indexing legacy-display parts When we notice a legacy-display part during indexing, it makes more sense to avoid indexing it as part of the message body. Given that the protected subject will already be indexed, there is no need to index this part at all, so we skip over it. If this happens during indexing, we set a property on the message: index.repaired=skip-protected-headers-legacy-display Signed-off-by: Daniel Kahn Gillmor <dkg@fifthhorseman.net>	2019-09-01 08:45:30 -03:00
Daniel Kahn Gillmor	1b29822cf5	repair: set up codebase for repair functionality This adds no functionality directly, but is a useful starting point for adding new repair functionality. Signed-off-by: Daniel Kahn Gillmor <dkg@fifthhorseman.net>	2019-09-01 08:20:25 -03:00
David Bremner	9dedb23b47	doc: document user header indexing. It's a bit odd that the primary documentation is in notmuch-config, but it is consistent with the "query:" prefix.	2019-05-25 07:21:21 -03:00
David Bremner	319dd95ebb	lib: add 'body:' field, stop indexing headers twice. The new `body:` field (in Xapian terms) or prefix (in slightly sloppier notmuch) terms allows matching terms that occur only in the body. Unprefixed query terms should continue to match anywhere (header or body) in the message. This follows a suggestion of Olly Betts to use the facility (since Xapian 1.0.4) to add the same field with multiple prefixes. The double indexing of previous versions is thus replaced with a query time expension of unprefixed query terms to the various prefixed equivalent. Reindexing will be needed for 'body:' searches to work correctly; otherwise they will also match messages where the term occur in headers (demonstrated by the new tests in T530-upgrade.sh)	2019-04-17 08:48:16 -03:00
Daniel Kahn Gillmor	fd3c93650d	doc: clean up manpages Many of the manpages didn't treat literal text as literal text. I've tried to normalize some of the restructured text to make it a bit more regular. several of the synopsis lines are still untouched by this cleanup, but i'm not sure what the right way to represent those is in .rst, actually. In particular find that if i rebuild the manpages, sometimes i end up with some of the synopsis lines showing – (U+2013 EN DASH) where they should have -- (2 × U+002D HYPHEN-MINUS) in the generated nroff output, though i have not tracked down the source of this error yet.	2018-06-24 21:59:37 -03:00
David Bremner	f2e6f76a04	doc: document thread subqueries Mention both performance and quoting issues.	2018-05-07 08:42:53 -03:00
David Bremner	20ba0b7dfa	doc: add a section on quoting to notmuch-search-terms(7) I think we've diverged enough from the Xapian query parser that we can't rely on that syntax description [1]. As far as I can tell, [1] also only discusses quotes in the context of phrases. [1]: https://xapian.org/docs/queryparser.html	2018-04-24 23:08:10 -03:00
Matthew Lear	0cbe982bfd	Clarify the syntax required when searching using timestamps. Need to be clearer about specifying time ranges using timestamps. Legacy syntax which predates the date prefix is still supported, but timestamps used in conjunction with the date prefix require additional syntax.	2018-03-24 20:07:20 -03:00
Jani Nikula	e5e252de55	doc: unify definition list usage across man pages Make all parameter descriptions etc. use reStructuredText definition lists with uniform style and indentation. Remove redundant indentation from around the lists. Remove blank lines between term lines and definition blocks. Use four spaces for indentation. This is almost completely whitespace and paragraph reflow changes.	2017-12-31 09:06:11 -04:00
Jani Nikula	89f651a403	doc: arrange search prefix documentation in a definition list Having first a list of prefixes followed by detailed descriptions was viable when we didn't have all that many prefixes. Now, arranging the prefix descriptions in a definition list makes more sense. While at it, include all the supported prefix forms, especially some missing regex ones.	2017-12-14 21:41:39 -04:00
Daniel Kahn Gillmor	29648a137c	crypto: actually stash session keys when decrypt=true If you're going to store the cleartext index of an encrypted message, in most situations you might just as well store the session key. Doing this storage has efficiency and recoverability advantages. Combined with a schedule of regular OpenPGP subkey rotation and destruction, this can also offer security benefits, like "deletable e-mail", which is the store-and-forward analog to "forward secrecy". But wait, i hear you saying, i have a special need to store cleartext indexes but it's really bad for me to store session keys! Maybe (let's imagine) i get lots of e-mails with incriminating photos attached, and i want to be able to search for them by the text in the e-mail, but i don't want someone with access to the index to be actually able to see the photos themselves. Fret not, the next patch in this series will support your wacky uncommon use case.	2017-12-08 08:08:47 -04:00
Daniel Kahn Gillmor	f845fb2a51	cli/show, reply: document use of stashed session keys in notmuch-properties The stashed session keys are stored internally as notmuch properties. So a user or developer who is reading about those properties might want to understand how they fit into the bigger picture. Note here that decrypting with a stored session key no longer needs -decrypt for "notmuch show" and "notmuch reply".	2017-12-08 08:08:46 -04:00
Daniel Kahn Gillmor	d3964e81ac	indexing: Change from try_decrypt to decrypt the command-line interface for indexing (reindex, new, insert) used --try-decrypt; and the configuration records used index.try_decrypt. But by comparison with "show" and "reply", there doesn't seem to be any reason for the "try" prefix. This changeset adjusts the command-line interface and the configuration interface. For the moment, i've left indexopts_{set,get}_try_decrypt alone. The subsequent changeset will address those.	2017-12-08 08:05:53 -04:00
Daniel Kahn Gillmor	a990585408	crypto: use stashed session-key properties for decryption, if available When doing any decryption, if the notmuch database knows of any session keys associated with the message in question, try them before defaulting to using default symmetric crypto. This changeset does the primary work in _notmuch_crypto_decrypt, which grows some new parameters to handle it. The primary advantage this patch offers is a significant speedup when rendering large encrypted threads ("notmuch show") if session keys happen to be cached. Additionally, it permits message composition without access to asymmetric secret keys ("notmuch reply"); and it permits recovering a cleartext index when reindexing after a "notmuch restore" for those messages that already have a session key stored. Note that we may try multiple decryptions here (e.g. if there are multiple session keys in the database), but we will ignore and throw away all the GMime errors except for those that come from last decryption attempt. Since we don't necessarily know at the time of the decryption that this is the last decryption attempt, we'll ask for the errors each time anyway. This does nothing if no session keys are stashed in the database, which is fine. Actually stashing session keys in the database will come as a subsequent patch.	2017-12-04 21:48:31 -04:00
Daniel Kahn Gillmor	d0da7a0a1c	config: define new option index.try_decrypt By default, notmuch won't try to decrypt on indexing. With this patch, we make it possible to indicate a per-database preference using the config variable "index.try_decrypt", which by default will be false. At indexing time, the database needs some way to know its internal defaults for how to index encrypted parts. It shouldn't be contingent on an external config file (since that can't be retrieved from the database object itself), so we store it in the database. This behaves similarly to the query.* configurations, which are also stored in the database itself, so we're not introducing any new dependencies by requiring that it be stored in the database.	2017-10-21 19:54:33 -03:00
Daniel Kahn Gillmor	4dfcc8c9b2	crypto: index encrypted parts when indexopts try_decrypt is set. If we see index options that ask us to decrypt when indexing a message, and we encounter an encrypted part, we'll try to descend into it. If we can decrypt, we add the property index.decryption=success. If we can't decrypt (or recognize the encrypted type of mail), we add the property index.decryption=failure. Note that a single message may have both values of the "index.decryption" property: "success" and "failure". For example, consider a message that includes multiple layers of encryption. If we manage to decrypt the outer layer ("index.decryption=success"), but fail on the inner layer ("index.decryption=failure"). Because of the property name, this will be automatically cleared (and possibly re-set) during re-indexing. This means it will subsequently correspond to the actual semantics of the stored index.	2017-10-21 19:53:19 -03:00
Daniel Kahn Gillmor	0bb05ff693	reindex: drop all properties named with prefix "index." This allows us to create new properties that will be automatically set during indexing, and cleared during re-indexing, just by choice of property name.	2017-10-21 19:53:08 -03:00
Daniel Kahn Gillmor	6575b7eb31	doc: add notmuch-properties(7) We will want a user-facing place to record details about the use of notmuch properties shortly. This establishes a new manual page for that purpose.	2017-10-21 19:52:55 -03:00

1 2

69 commits