[go: up one dir, main page]

Menu

Code Commit Log


Commit Date  
[r1158] by mukau

+ tagged ddc-2.1.22

2018-12-12 11:23:46 Tree
[r1157] by mukau

v2.1.22: bug-fixes for new prefix- and suffix-search introduced in v2.1.21 (mantis #33072 [bams|zeit])
* speed improvements too
- prefix searches are 7-26% faster than ddc-v2.1.18
- suffix search performance gains from v2.1.21 remain; especially for highly specific suffixes:
+ *haus : ca. 32x faster (S=32.07)
+ *erde : ca. 74x faster (S=74.49)

2018-12-12 10:31:32 Tree
[r1156] by mukau

+ tagged ddc-2.1.21

2018-12-10 15:14:31 Tree
[r1155] by mukau

+ ddc-2.1.21 release

2018-12-10 15:14:18 Tree
[r1154] by mukau

+ tagged ddc-2.1.21-rc1

2018-12-10 15:03:01 Tree
[r1153] by mukau

v2.1.21-rc1:
* added suffix-query optimization via new index files CORPUS._suffix_TOKATTR
- new low-level evaluation method CStringIndexSet::QueryTokenListWithRightTruncation()
- new suffix-sorted index CIndexSetForQueryingStage::m_rIndex
- can result in drastically improved suffix query-times compared to old regex-based vocabulary scan:
+ *en : 24% faster (Amdahl S=1.24)
+ *chen : 169% faster (Amdahl S=3.69)
+ *ber : 432% faster (Amdahl S=5.32)
* added ddc_(index|union|split) code to create CORPUS._suffix_TOKATTR for all indices at build-time
- very small memory and disk footprint (<1% total index size, growth O(NTypes))
* suffix-indices are still optional: if not present, old regex-based vocabulary scan will be used

2018-12-10 15:00:54 Tree
[r1152] by mukau

+ suffix-index support for ddc_(index|union|split)

2018-12-10 13:35:53 Tree
[r1151] by mukau

+ suffix index: documented and cleaned up testing code
+ TODO: ddc_(index|union|split) integration

2018-12-10 12:57:41 Tree
[r1150] by mukau

+ suffix-index tests

2018-12-10 12:27:57 Tree
[r1149] by mukau

+ first stab at auxiliary suffix-index
- no ddc_index-support yet, test build with perl script scripts/ddc-make-suffix-index.perl
+ can be up to 7x faster for small-ish sets of matched types (e.g. *ber)
+ only ca. 20% improvement for large match-sets (e.g. *en)
- todo: optimize in-place suffix-comparison struct, avoid temporaries

2018-12-07 14:03:03 Tree
Older >