bcb4e51a409d94ae670de96afb8483a4f7855294 |
|
01-Jan-2018 |
Stephan Bosch <stephan.bosch@dovecot.fi> |
Updated copyright notices to include the year 2018. |
993b1b96f38df4970fb2c41d47b660a8c64bda98 |
|
14-Dec-2017 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
fts: Code cleanup |
3813ebcae2fe8b75a610fe20b20a25a5919043f2 |
|
14-Dec-2017 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
fts: Don't try to retry parsing if indexing already fails |
d7c139d9a260abc012ce97ab4945bb64b4af2c10 |
|
14-Dec-2017 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
fts: Use consistent naming for retriable_err_msg
It's only for retriable errors, not errors in general |
d8ebe885f013b600e4ebe632bb07e295fbdd628b |
|
14-Dec-2017 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
fts: Make sure may_need_retry is reset between attempts.
It wasn't if the second attempt failed early. |
8e1eaf849ce4668c0941338e3e3671db159da575 |
|
11-Dec-2017 |
Sergey Kitov <sergey.kitov@open-xchange.com> |
fts: Retry indexing for tika backend in case of internal server error |
72d301f5751986396533088b769a9e74735a467c |
|
11-Dec-2017 |
Sergey Kitov <sergey.kitov@open-xchange.com> |
fts: Extend parser's deinit function interface with retriable_err_msg_r parameter
the new parameter will be set to a error string in case indexing may
need retrying, so it will be logged later when decision about retry will be made. |
43f4e550efe7afa72aab3a7ded7ad7ce606526fb |
|
11-Dec-2017 |
Sergey Kitov <sergey.kitov@open-xchange.com> |
fts: Parser move try_init arguments to a single structure parser_context. |
6307d76096764e66bddc63d4a3e5a1aa19cc528f |
|
19-Sep-2017 |
Josef 'Jeff' Sipek <jeff.sipek@dovecot.fi> |
global: start relying on buffer_free(NULL) being a no-op
Cleanup performed with the following semantic patch:
@@
expression E;
@@
- if (E != NULL) {
- buffer_free(&E);
- }
+ buffer_free(&E); |
bf7dc750b95039981c0e9d728f313d50cf38a156 |
|
23-Mar-2017 |
Martti Rannanjärvi <martti.rannanjarvi@dovecot.fi> |
global: Log internal storage error on failure |
2454dfa32c93c20a8522c6ed42fe057baaac9f9a |
|
01-Jan-2017 |
Stephan Bosch <stephan.bosch@dovecot.fi> |
Updated copyright notices to include the year 2017. |
efe78d3ba24fc866af1c79b9223dc0809ba26cad |
|
24-Dec-2016 |
Stephan Bosch <stephan.bosch@dovecot.fi> |
global: Replaced all instances of memset(p, 0, sizeof(*p)) with the new i_zero() macro.
Used the following script:
C_FILES=`git ls-files *.c`
H_FILES=`git ls-files *.h`
for F in "$C_FILES $H_FILES"; do
echo "$F"
perl -p -i -e 's/safe_memset\(&\(?([^,]*)\)?,\s*0,\s*sizeof\(\g1\)\)/i_zero_safe(&$1)/g' $F
perl -p -i -e 's/safe_memset\(([^,]*),\s*0,\s*sizeof\(\*\g1\)\)/i_zero_safe($1)/g' $F
perl -p -i -e 's/memset\(&\(?([^,]*)\)?,\s*0,\s*sizeof\(\g1\)\)/i_zero(&$1)/g' $F
perl -p -i -e 's/memset\(([^,]*),\s*0,\s*sizeof\(\*\g1\)\)/i_zero($1)/g' $F
done |
13961419ca9710eb80c254e00510c58c466f3c08 |
|
18-Oct-2016 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
global: Handle broken Content-Type headers consistently.
1) Only the first Content-Type header is used. (Simpler than using the last.)
2) Invalid Content-Types are parsed as far as we can. This is mainly to make
sure they aren't treated as missing Content-Types, which could cause them to
become text/plain. |
02b78558dc03daa2e7da2010b63f247b49936a38 |
|
03-Aug-2016 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
Use mail_get_*stream_because() wherever possible. |
489301ee88b2174e3171875e979e667de2c4a174 |
|
28-Feb-2016 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
lib-storage: When cached mime.parts is detected to be inconsistent, log it as hex-encoded. |
8b5b1f6cb19253dfd7821fcef8e9b7e95e6caf3a |
|
28-Feb-2016 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
Use mail_set_cache_corrupted_reason() wherever possible. |
4ac43ad128cb9b22f1c936659a1ff62e5b6c8bc3 |
|
16-Feb-2016 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
fts: Code cleanup - moved fts_header_has_language() to a global function |
02c335c23bf5fa225a467c19f2c063fb0dc7b8c3 |
|
12-Jan-2016 |
Timo Sirainen <timo.sirainen@dovecot.fi> |
global: freshen copyright
git ls-files | xargs perl -p -i -e 's/(\d+)-201[0-5]/$1-2016/g;s/ (201[0-5]) Dovecot/ $1-2016 Dovecot/' |
b68472c305d841dc703a37145ebad9e85f1c02ab |
|
08-Dec-2015 |
Timo Sirainen <tss@iki.fi> |
fts: When indexing header names with lib-fts, add them using "data" language. |
dbf26a3ea43cd79fe88f01ec99c7d9440679b996 |
|
03-Dec-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Added support for per-language tokenizer settings.
fts_tokenizer_<lang> now overrides fts_tokenizers setting.
fts_tokenizer_<name>_<lang> now overrides fts_tokenizer_<name> setting. |
07fcb647e58d930cc601fdecc94ff2da9ec44882 |
|
03-Jun-2015 |
Timo Sirainen <tss@iki.fi> |
fts: If reading mail fails, log an error. |
c2c6342dbf982adddf24058366582ce6eab2ed7c |
|
02-Jun-2015 |
Timo Sirainen <tss@iki.fi> |
fts: If we fail while indexing headers, return failure immediately. |
3d00e356f156bb5d105de228fac7ed8f28851638 |
|
02-Jun-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Error logging fix.
1) We were logging the error after it was already freed from data stack.
2) We were logging uninitialized error string when fts indexing was the one
that failed. |
0983b2432833a98dd35674b7106069a601b846e3 |
|
16-May-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Avoid excessive data stack usage with lib-fts |
9bee6caa05bcdb38a631d56708792ff79768a0bb |
|
09-May-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Fixed crash when not using lib-fts |
8b1a9a4d63b0abccdf7cb1acb8359d5396dd657b |
|
09-May-2015 |
Timo Sirainen <tss@iki.fi> |
lib-fts: Changed fts_tokenizer_next/final() to return error string.
The current tokenizers can't fail, but if we're doing tokenization via
external services they could fail. |
60ba197d17023594231d9805d889817782e41859 |
|
09-May-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Lowecase non-human language input while indexing. |
5ea079cebbcf12cde7070a9174b33e1bbab72465 |
|
09-May-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Reset tokenizers before using them
This is mainly needed if the previous tokenization had failed. |
1d0f568e26ce5cbf18cd7bb335c6eea20a7e3770 |
|
09-May-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
fts: Improve filter error handling.
In lib-fts:
Move error storage to the generic filter struct level. Change make_utf8
helper to void also.
In fts:
Add the error argument to _filter() calls. |
db090e2a48bcc5ce162af0c36eef04407421379d |
|
09-May-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
fts: Change filter API to be able to return errors
Modify fts_filter_filter() to return integer status codes. It returns
1 if a token was returned, 0 if it was filtered away and -1 on error. |
3dc5a231160859c9627157dc53a94d5e4494fe9f |
|
09-May-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
fts: Change tokenizer API to be able to return errors
Modify fts_tokenizer_next() to return integer status codes. It returns
1 if a token was returned in *token_r, 0 if more input is needed and -1
on error. |
4ef1f9f3293965734e6e3c38c191ceb2246a721f |
|
09-May-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
fts: Create tokenizers differently
Create tokenizers earlier. Create separate tokenizers for search
and indexing. Enable configuration of tokenizers. Add some helpers
in fts-tokenizer.h api. Change tokenizer unit tests to match
those changes.
lib-fts: Refactor lib-fts settings a bit
Turned address tokenizer settings into "boolean" values. Changed
have_parent to "no_parent" and added "search" setting. Added
documentation in fts-tokenizer.h. Change unit tests accordingly. |
09aed882b99e865ff6d7140ae2f77a42c9e7d1a7 |
|
09-May-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
fts: Create filters earlier
This builds on the assumption that early initialization will be made
less costly, in a coming change. |
a32b055a1a537c2e2eade309bae70b1344bb3b37 |
|
24-Apr-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Minor error messagage prefix change. |
76cb5c2b65f8ca773c253e2c3b027517c388812b |
|
21-Apr-2015 |
Timo Sirainen <tss@iki.fi> |
fts: If backend has FTS_BACKEND_FLAG_TOKENIZED_INPUT set, index also the header name.
We were sending the hdr_name to the backend without tokenizing it or
filtering it, so the backend couldn't have done anything useful about it. |
1537d20b852cbbf0d6971790b84e0cce5ca61307 |
|
20-Apr-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Added FTS_BACKEND_FLAG_TOKENIZED_INPUT, which is implemented via lib-fts. |
fed97f5beeadfe51190d772c5b3a6635c741f2c8 |
|
20-Apr-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Code cleanup to do all index updates via fts_build_data()
This doesn't actually change the behavior. |
70fc76bbc5b4e57168c2b75a97d26edc1886ef36 |
|
18-Mar-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Fixed assert-crash if fts-parser was created for a body part that backend didn't want to index. |
5518182f1165884742f9eb37eb2e6136b29394e8 |
|
13-Mar-2015 |
Timo Sirainen <tss@iki.fi> |
fts: If fts-parser fails, stop indexing instead of ignoring the error and continuing.
This is especially important in case there's just some temporary error. |
78199a8ade26f7694010f1096c6fdf055a5553bd |
|
11-Mar-2015 |
Timo Sirainen <tss@iki.fi> |
fts: Added missing error logging/setting. |
a5f7c7c3ef03fbf956df028a49f2fa6830428509 |
|
27-Jan-2015 |
Timo Sirainen <tss@iki.fi> |
fts: When indexing address headers, don't add MISSING_MAILBOX or MISSING_DOMAIN. |
7cb128dc4cae2a03a742f63ba7afee23c78e3af0 |
|
05-Jan-2015 |
Phil Carmody <phil@dovecot.fi> |
global: freshen copyright
Robomatically:
git ls-files | xargs perl -p -i -e 's/(\d+)-201[0-4]/$1-2015/g;s/ (201[0-4]) Dovecot/ $1-2015 Dovecot/'
Happy 2015 everyone!
Signed-off-by: Phil Carmody <phil@dovecot.fi> |
69111aa5f9df3c49aeefbe0ac6118cf80c7520f1 |
|
17-Apr-2014 |
Timo Sirainen <tss@iki.fi> |
fts: Added message_part to fts_backend_build_key.
This allows FTS backends to index MIME parts separately from each others. |
f8e6a1246e8ec66bf75c85edfe0797aa68808d8e |
|
17-Apr-2014 |
Timo Sirainen <tss@iki.fi> |
fts: Free fts-parser on failures. |
a8c5a86d183db25a57bf193c06b41e092ec2e151 |
|
04-Feb-2014 |
Timo Sirainen <tss@iki.fi> |
Updated copyright notices to include year 2014. |
8ae72ad7d0c69e972cfa65d1e2ce4e3e9a8b765c |
|
24-Feb-2013 |
Timo Sirainen <tss@iki.fi> |
Replaced all -1U and (unsigned int)-1 with UINT_MAX.
It's somewhat clearer this way. Also clang's -fsanitize=integer gives
runtime errors about -1U (but not about explicit casts, so no need to change
(type)-1 casts). |
e6231264bedab49be190f2e89fd9cb6ca59ae62d |
|
22-Feb-2013 |
Timo Sirainen <tss@iki.fi> |
fts: Don't send binary MIME parts to backend through UTF8 text conversion.
Based on patch by Mike Abbott / Apple. |
5a580c3a38ced62d4bcc95b8ac7c4f2935b5d294 |
|
02-Feb-2013 |
Timo Sirainen <tss@iki.fi> |
Oops :) Update copyrights to 2013 without breaking all .c files. |
cca4ba2a504d70a9fe9fee37f8433997359de52c |
|
02-Feb-2013 |
Timo Sirainen <tss@iki.fi> |
Updated copyright notices to include year 2013. |
c51afc0ab251923fbfcad5059af27a7fefab3502 |
|
27-Nov-2012 |
Timo Sirainen <tss@iki.fi> |
Reversed recent "short utf8" changes.
Solr code needs to parse the UTF8 input explicitly anyway to encode the XML
characters. And all the character checks were already done in it. |
119cbc046f3c6c245d1f5ad4f5d14ea20d520bdc |
|
27-Nov-2012 |
Timo Sirainen <tss@iki.fi> |
fts: Added FTS_BACKEND_FLAG_BUILD_SHORT_UTF8 to require sending only short UTF8 data to backend. |
d9076f5939edf5d20a261494b1a861dcbb0d32e2 |
|
15-Sep-2012 |
Timo Sirainen <tss@iki.fi> |
Replaced "decomposed titlecase" conversions with more generic normalizer function.
Plugins can now change mail_user.default_normalizer. Specific searches can
also use different normalizers by changing mail_search_context.normalizer. |
4e8d6d03c2ff85448df79b181a2ea850fb5d4199 |
|
27-Jun-2012 |
Timo Sirainen <tss@iki.fi> |
Try to avoid (void) casts by adding more ATTR_NOWARN_UNUSED_RESULT. |
5f5870385cff47efd2f58e7892f251cf13761528 |
|
12-Feb-2012 |
Timo Sirainen <tss@iki.fi> |
Updated copyright notices to include year 2012. |
84f447a23beceb6ab5898083052e1b5bae51f033 |
|
05-Nov-2011 |
Timo Sirainen <tss@iki.fi> |
fts: Added FTS_BACKEND_FLAG_BUILD_FULL_WORDS for sending data to backends only in full words. |
421d30619384e72a27e2a5d13ff6525aff4d17fe |
|
31-Aug-2011 |
Timo Sirainen <tss@iki.fi> |
Redesigned mail precaching APIs.
There's no longer a one monolithic mailbox_sync(MAILBOX_SYNC_FLAG_PRECACHE)
call, but rather one mail_precache() call for each mail to be precached.
This allows the callers to show the progress and in general is cleaner. |