[go: up one dir, main page]

Menu

/ddc Commit Log


Commit Date  
[r1154] by mukau

+ tagged ddc-2.1.21-rc1

2018-12-10 15:03:01 Tree
[r1153] by mukau

v2.1.21-rc1:
* added suffix-query optimization via new index files CORPUS._suffix_TOKATTR
- new low-level evaluation method CStringIndexSet::QueryTokenListWithRightTruncation()
- new suffix-sorted index CIndexSetForQueryingStage::m_rIndex
- can result in drastically improved suffix query-times compared to old regex-based vocabulary scan:
+ *en : 24% faster (Amdahl S=1.24)
+ *chen : 169% faster (Amdahl S=3.69)
+ *ber : 432% faster (Amdahl S=5.32)
* added ddc_(index|union|split) code to create CORPUS._suffix_TOKATTR for all indices at build-time
- very small memory and disk footprint (<1% total index size, growth O(NTypes))
* suffix-indices are still optional: if not present, old regex-based vocabulary scan will be used

2018-12-10 15:00:54 Tree
[r1152] by mukau

+ suffix-index support for ddc_(index|union|split)

2018-12-10 13:35:53 Tree
[r1151] by mukau

+ suffix index: documented and cleaned up testing code
+ TODO: ddc_(index|union|split) integration

2018-12-10 12:57:41 Tree
[r1150] by mukau

+ suffix-index tests

2018-12-10 12:27:57 Tree
[r1149] by mukau

+ first stab at auxiliary suffix-index
- no ddc_index-support yet, test build with perl script scripts/ddc-make-suffix-index.perl
+ can be up to 7x faster for small-ish sets of matched types (e.g. *ber)
+ only ca. 20% improvement for large match-sets (e.g. *en)
- todo: optimize in-place suffix-comparison struct, avoid temporaries

2018-12-07 14:03:03 Tree
[r1148] by mukau

+ tagged ddc-2.1.20

2018-12-06 09:48:03 Tree
[r1147] by mukau

+ v2.1.20: bugfix for WITH

2018-12-06 09:47:17 Tree
[r1146] by mukau

+ removed tag ddc-2.1.20

2018-12-06 09:47:03 Tree
[r1145] by mukau

+ tagged ddc-2.1.20

2018-12-06 09:43:22 Tree
Older >