[go: up one dir, main page]

Menu

/ddc/trunk Commit Log


Commit Date  
[r1126] by mukau

* binary search for CQuerySequenceNode -- not much joy (max ca. 3% faster)

2018-11-29 08:36:23 Tree
[r1125] by mukau

* added macro DDC_GETBREAK_LB_MIN to QueryNode.cpp
- threshold for use of std::lower_bound() in break-identification step of CQueryNode::ConvertOccurrencesToHits()
- default value = 16k -> ca. 10% faster queries

2018-11-28 11:47:15 Tree
[r1124] by mukau

v2.1.19-pre1 * added macro DDC_HIT_INTERSECTION_BINSEARCH to QueryNode.cpp
- use std::lower_bound for evaluating hit-list intersection and difference
- may be 3-5% faster for large hit-lists (function words)
* attempted profiling of CQueryNode::ConvertOccurrencesToHits() [perf, callgrind]: no insights

2018-11-27 15:08:07 Tree
[r1122] by mukau

v2.1.18 release

2018-11-02 12:16:02 Tree
[r1119] by mukau

+ force LC_NUMERIC=C in ConcordOptions.cpp (otherwise JSON responses may not be parseable)

2018-10-25 08:27:41 Tree
[r1118] by mukau

+ v2.1.18-rc2: force LC_NUMERIC=C (in progress)

2018-10-25 08:24:36 Tree
[r1113] by mukau

* fixed ddc_split in presence of legacy anonymized tokens (those beginning with -#)
- workaround (hack): all types with 0 occurrences in source corpus are assumed to by anonymized,
and are added to the output vocabulary of *every* partition -- this is a bit bloated but should work
* added ddc_index option --anonymize (-a): enable legacy anonymized tokens (disabled by default)
- fixes mantis #31586

2018-10-23 13:32:29 Tree
[r1112] by mukau

* trim *._con_prefix

2018-10-23 10:34:03 Tree
[r1109] by mukau

* added AC_DEFINE()s for __STDC_LIMIT_MACROS, __STDC_CONSTANT_MACROS
- workaround for old C++ compilers (e.g. debian wheezy)
- fixes compile error for ddcTime.cpp: 'SIZE_MAX' was not declared in this scope

2018-10-19 07:25:31 Tree
[r1108] by mukau

- mantis #31439: added missing Trim() for m_CommonFilePrefix in CConcIndexator::LoadCorpusFiles()

2018-10-15 07:17:11 Tree
Older >