cunei-commits Mailing List for Cunei Machine Translation Platform (Page 2)
Status: Beta
Brought to you by:
aaronbphillips
You can subscribe to this list here.
| 2009 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(96) |
Jul
(161) |
Aug
(12) |
Sep
(18) |
Oct
(17) |
Nov
(6) |
Dec
(8) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2010 |
Jan
(28) |
Feb
(18) |
Mar
(26) |
Apr
(37) |
May
(37) |
Jun
(10) |
Jul
(5) |
Aug
(41) |
Sep
(11) |
Oct
(32) |
Nov
(30) |
Dec
(28) |
| 2011 |
Jan
(29) |
Feb
(16) |
Mar
(23) |
Apr
(40) |
May
(15) |
Jun
(14) |
Jul
(24) |
Aug
(7) |
Sep
(16) |
Oct
(4) |
Nov
(12) |
Dec
|
| 2012 |
Jan
|
Feb
(4) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
|
From: <aar...@us...> - 2011-09-21 03:14:19
|
Revision: 801
http://cunei.svn.sourceforge.net/cunei/?rev=801&view=rev
Author: aaronbphillips
Date: 2011-09-21 03:14:12 +0000 (Wed, 21 Sep 2011)
Log Message:
-----------
Increased the default value for the maximum size of the n-best list to retain during optimization
Modified Paths:
--------------
src/cunei/cli/Optimize.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-15 13:25:55
|
Revision: 800
http://cunei.svn.sourceforge.net/cunei/?rev=800&view=rev
Author: aaronbphillips
Date: 2011-09-15 13:25:46 +0000 (Thu, 15 Sep 2011)
Log Message:
-----------
Modified handling of the number of threads
Modified Paths:
--------------
data/data.xml
src/cunei/util/ThreadExecutor.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-14 19:18:33
|
Revision: 799
http://cunei.svn.sourceforge.net/cunei/?rev=799&view=rev
Author: aaronbphillips
Date: 2011-09-14 19:18:27 +0000 (Wed, 14 Sep 2011)
Log Message:
-----------
Added final decoding run after optimization
Modified Paths:
--------------
data/systems/fr-en.default/build.xml
data/systems/kr-en.default/build.xml
data/systems/systems.xml
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-14 18:51:24
|
Revision: 798
http://cunei.svn.sourceforge.net/cunei/?rev=798&view=rev
Author: aaronbphillips
Date: 2011-09-14 18:51:18 +0000 (Wed, 14 Sep 2011)
Log Message:
-----------
Added Europarl French-English tune and test sets
Modified Paths:
--------------
data/systems/cz-en.cluster.default/build.xml
Added Paths:
-----------
data/corpora/fr-en/
data/corpora/fr-en/raw/
data/corpora/fr-en/raw/europarl-v6-test-sample-test.sentences
data/corpora/fr-en/raw/europarl-v6-test-sample-tune.sentences
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-13 12:44:19
|
Revision: 797
http://cunei.svn.sourceforge.net/cunei/?rev=797&view=rev
Author: aaronbphillips
Date: 2011-09-13 12:44:07 +0000 (Tue, 13 Sep 2011)
Log Message:
-----------
Added simple French-English system to build scripts
Modified Paths:
--------------
bin/cluster-mkcls.sh
data/data.xml
data/lmodels/en/build.xml
data/systems/cz-en.cluster.default/build.xml
data/systems/cz-en.default/build.xml
data/systems/de-en.default/build.xml
data/systems/kr-en.default/build.xml
data/systems/systems.xml
Added Paths:
-----------
data/lmodels/europarl.xml
data/lmodels/news-europarl-czeng.xml
data/lmodels/samsung.xml
data/systems/cz-en.cluster.small.default/
data/systems/cz-en.cluster.small.default/build.xml
data/systems/fr-en.default/
data/systems/fr-en.default/build.xml
Removed Paths:
-------------
data/lmodels/en/news-europarl-czeng.xml
data/lmodels/en/samsung.xml
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-12 13:01:29
|
Revision: 796
http://cunei.svn.sourceforge.net/cunei/?rev=796&view=rev
Author: aaronbphillips
Date: 2011-09-12 13:01:23 +0000 (Mon, 12 Sep 2011)
Log Message:
-----------
Added preliminary build script for cz-en clusters
Modified Paths:
--------------
data/systems/cz-en.default/build.xml
data/systems/de-en.default/build.xml
data/systems/kr-en.default/build.xml
data/systems/systems.xml
Added Paths:
-----------
data/systems/cz-en.cluster.default/
data/systems/cz-en.cluster.default/build.xml
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-12 13:01:19
|
Revision: 795
http://cunei.svn.sourceforge.net/cunei/?rev=795&view=rev
Author: aaronbphillips
Date: 2011-09-12 13:01:12 +0000 (Mon, 12 Sep 2011)
Log Message:
-----------
Added preliminary build script for cz-en clusters
Modified Paths:
--------------
bin/export-czeng.pl
bin/export-europarl.pl
Added Paths:
-----------
bin/cluster-mkcls.sh
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-12 01:57:45
|
Revision: 794
http://cunei.svn.sourceforge.net/cunei/?rev=794&view=rev
Author: aaronbphillips
Date: 2011-09-12 01:57:39 +0000 (Mon, 12 Sep 2011)
Log Message:
-----------
Added sampled test sets for Europarl and CzEng
Added Paths:
-----------
data/corpora/cz-en/raw/
data/corpora/cz-en/raw/czeng-v0.9-dev-sample-test.sentences
data/corpora/cz-en/raw/czeng-v0.9-dev-sample-tune.sentences
data/corpora/de-en/raw/
data/corpora/de-en/raw/europarl-v6-test-sample-test.sentences
data/corpora/de-en/raw/europarl-v6-test-sample-tune.sentences
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-12 01:51:19
|
Revision: 793
http://cunei.svn.sourceforge.net/cunei/?rev=793&view=rev
Author: aaronbphillips
Date: 2011-09-12 01:51:12 +0000 (Mon, 12 Sep 2011)
Log Message:
-----------
Updated build scripts to work properly with the lastest revision
Modified Paths:
--------------
bin/cunei.sh
bin/eval.sh
bin/export-europarl.pl
build.xml
data/corpora/corpora.xml
data/corpora/cz-en/czeng-v0.9.xml
data/corpora/europarl-v6.xml
data/corpora/kr-en/samsung.xml
data/corpora/mono.xml
data/corpora/wmt-2011.xml
data/data.xml
data/lmodels/en/news-europarl-czeng.xml
data/lmodels/lmodels.xml
data/systems/cz-en.default/build.xml
data/systems/de-en.default/build.xml
data/systems/kr-en.default/build.xml
data/systems/systems.xml
src/cunei/cli/ProcessCorpus.java
src/cunei/corpus/MultiFileCorpusWriter.java
Added Paths:
-----------
bin/export-czeng.pl
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-06 13:36:42
|
Revision: 792
http://cunei.svn.sourceforge.net/cunei/?rev=792&view=rev
Author: aaronbphillips
Date: 2011-09-06 13:36:36 +0000 (Tue, 06 Sep 2011)
Log Message:
-----------
Fixed bug in GUI where the source phrase was used instead of the target phrase
Modified Paths:
--------------
src/cunei/ui/WorkbenchWindow.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-09-05 18:23:38
|
Revision: 791
http://cunei.svn.sourceforge.net/cunei/?rev=791&view=rev
Author: aaronbphillips
Date: 2011-09-05 18:23:32 +0000 (Mon, 05 Sep 2011)
Log Message:
-----------
Added option to compute the second derivative only along the diagonal. This is not as accurate, but it uses less memory and is much quicker.
Modified Paths:
--------------
src/cunei/model/SumLogLinearModel.java
src/cunei/translate/TranslationLattice.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-30 12:13:01
|
Revision: 790
http://cunei.svn.sourceforge.net/cunei/?rev=790&view=rev
Author: aaronbphillips
Date: 2011-08-30 12:12:55 +0000 (Tue, 30 Aug 2011)
Log Message:
-----------
Modified the pruning algorithm during optimization and set the default parameter to be a bit more aggressive.
Modified Paths:
--------------
src/cunei/cli/Optimize.java
src/cunei/evaluation/AbstractEvaluation.java
src/cunei/optimize/Optimizer.java
src/cunei/optimize/ProjectedEvaluation.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-25 18:58:11
|
Revision: 789
http://cunei.svn.sourceforge.net/cunei/?rev=789&view=rev
Author: aaronbphillips
Date: 2011-08-25 18:58:05 +0000 (Thu, 25 Aug 2011)
Log Message:
-----------
Projection discounting is now disabled by default
Modified Paths:
--------------
src/cunei/cli/Optimize.java
src/cunei/model/SumLogLinearModel.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-24 15:36:35
|
Revision: 788
http://cunei.svn.sourceforge.net/cunei/?rev=788&view=rev
Author: aaronbphillips
Date: 2011-08-24 15:36:29 +0000 (Wed, 24 Aug 2011)
Log Message:
-----------
Increased default parameter tolerance for convergence
Modified Paths:
--------------
src/cunei/cli/Optimize.java
src/cunei/optimize/Optimizer.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-23 15:33:43
|
Revision: 787
http://cunei.svn.sourceforge.net/cunei/?rev=787&view=rev
Author: aaronbphillips
Date: 2011-08-23 15:33:37 +0000 (Tue, 23 Aug 2011)
Log Message:
-----------
Corrected distance projection gradient
Modified Paths:
--------------
src/cunei/model/SumLogLinearModel.java
src/cunei/optimize/Optimizer.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-08 19:19:20
|
Revision: 786
http://cunei.svn.sourceforge.net/cunei/?rev=786&view=rev
Author: aaronbphillips
Date: 2011-08-08 19:19:14 +0000 (Mon, 08 Aug 2011)
Log Message:
-----------
Updated default parameters for SimilarityModel and related features
Modified Paths:
--------------
src/cunei/decode/SentenceModel.java
src/cunei/translate/CoverageModel.java
src/cunei/translate/Match.java
src/cunei/translate/SimilarityModel.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-04 14:48:22
|
Revision: 785
http://cunei.svn.sourceforge.net/cunei/?rev=785&view=rev
Author: aaronbphillips
Date: 2011-08-04 14:48:16 +0000 (Thu, 04 Aug 2011)
Log Message:
-----------
Adjusted the default weights so features computed on the Lexical sequence have more impact than those for other sequence types
Modified Paths:
--------------
src/cunei/lexicon/Lexicons.java
src/cunei/translate/FrequencyModel.java
src/cunei/translate/Match.java
src/cunei/translate/SimilarityModel.java
src/cunei/translate/TypeModel.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-08-03 20:26:44
|
Revision: 784
http://cunei.svn.sourceforge.net/cunei/?rev=784&view=rev
Author: aaronbphillips
Date: 2011-08-03 20:26:38 +0000 (Wed, 03 Aug 2011)
Log Message:
-----------
Changed divergence features. Minor edits to optimization.
Modified Paths:
--------------
src/cunei/cli/Optimize.java
src/cunei/optimize/Optimizer.java
src/cunei/translate/CoverageModel.java
src/cunei/translate/Match.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-31 03:11:29
|
Revision: 783
http://cunei.svn.sourceforge.net/cunei/?rev=783&view=rev
Author: aaronbphillips
Date: 2011-07-31 03:11:23 +0000 (Sun, 31 Jul 2011)
Log Message:
-----------
The default values for most weights are now normalized by the number of sequence types known to the system. The SimilarityModel has been re-tooled and now includes a weighted F1 feature to help preserve content words.
Modified Paths:
--------------
src/cunei/alignment/PhraseAlignment.java
src/cunei/confusion/SynonymLattice.java
src/cunei/corpus/Context.java
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/MultilingualCorpus.java
src/cunei/hypothesis/ReferenceMatcher.java
src/cunei/hypothesis/ReferenceSimilarityModel.java
src/cunei/lexicon/Lexicons.java
src/cunei/model/SumLogLinearModel.java
src/cunei/optimize/Optimizer.java
src/cunei/translate/CoverageModel.java
src/cunei/translate/FrequencyModel.java
src/cunei/translate/Match.java
src/cunei/translate/SimilarityModel.java
src/cunei/translate/TranslationLattice.java
src/cunei/translate/TranslationSimilarityModel.java
src/cunei/translate/TypeModel.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-27 20:18:30
|
Revision: 782
http://cunei.svn.sourceforge.net/cunei/?rev=782&view=rev
Author: aaronbphillips
Date: 2011-07-27 20:18:24 +0000 (Wed, 27 Jul 2011)
Log Message:
-----------
Updated Optimizer to use a distance limit based on the SumLogLinearModel projection distance instead of the movement limit that was calculated independently. After an evaluation all projected models are now updated to be at least as bad as the worst log score. The BLEU metric was split into 1-Best and Expected variants which are both added as a default metrics.
Modified Paths:
--------------
src/cunei/cli/Optimize.java
src/cunei/evaluation/AbstractEvaluation.java
src/cunei/evaluation/BLEU.java
src/cunei/evaluation/Evaluation.java
src/cunei/evaluation/MultiExpectation.java
src/cunei/hypothesis/Segments.java
src/cunei/lexicon/CorpusLexicon.java
src/cunei/lexicon/ExternalLexicon.java
src/cunei/lm/BackoffLanguageModel.java
src/cunei/model/LogFeatureModel.java
src/cunei/model/LogLinearModel.java
src/cunei/model/SumLogLinearModel.java
src/cunei/optimize/Optimizer.java
src/cunei/optimize/ProjectedEvaluation.java
src/cunei/translate/MatchLattice.java
Added Paths:
-----------
src/cunei/evaluation/ExpectedBLEU.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-24 12:07:14
|
Revision: 781
http://cunei.svn.sourceforge.net/cunei/?rev=781&view=rev
Author: aaronbphillips
Date: 2011-07-24 12:07:08 +0000 (Sun, 24 Jul 2011)
Log Message:
-----------
Fixed problem in FrequencyModel with translations that do no occur in the corpus. Several minor changes to Optimizer that (hopefully) improve stability. There is still some excess debugging output in Optimizer that should be removed.
Modified Paths:
--------------
src/cunei/evaluation/AbstractEvaluation.java
src/cunei/evaluation/Evaluation.java
src/cunei/optimize/Optimizer.java
src/cunei/optimize/ProjectedEvaluation.java
src/cunei/translate/FrequencyModel.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-22 19:07:31
|
Revision: 780
http://cunei.svn.sourceforge.net/cunei/?rev=780&view=rev
Author: aaronbphillips
Date: 2011-07-22 19:07:25 +0000 (Fri, 22 Jul 2011)
Log Message:
-----------
Minor modifications to Lexicon scoring to avoid recomputing the typeIds.
Modified Paths:
--------------
src/cunei/corpus/MultilingualCorpus.java
src/cunei/lexicon/CorpusLexicon.java
src/cunei/lexicon/ExternalLexicon.java
src/cunei/lexicon/Lexicon.java
src/cunei/lexicon/Lexicons.java
src/cunei/translate/FrequencyModel.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-21 22:14:43
|
Revision: 779
http://cunei.svn.sourceforge.net/cunei/?rev=779&view=rev
Author: aaronbphillips
Date: 2011-07-21 22:14:37 +0000 (Thu, 21 Jul 2011)
Log Message:
-----------
New frequency model. In addition, the frequency, lexicon, and context features are all normalized by the total number of type sequences.
Modified Paths:
--------------
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/SequenceIndex.java
src/cunei/lexicon/Lexicons.java
src/cunei/translate/FrequencyModel.java
src/cunei/translate/Match.java
src/cunei/translate/MatchLattice.java
src/cunei/translate/TranslationLattice.java
src/cunei/util/IntegerBoundIndex.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-20 16:17:24
|
Revision: 778
http://cunei.svn.sourceforge.net/cunei/?rev=778&view=rev
Author: aaronbphillips
Date: 2011-07-20 16:17:18 +0000 (Wed, 20 Jul 2011)
Log Message:
-----------
Removed sessionId from Type to reduce memory usage.
Modified Paths:
--------------
src/cunei/cli/Frequencies.java
src/cunei/cli/IndexLexicon.java
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/MultilingualCorpus.java
src/cunei/corpus/SequenceIndex.java
src/cunei/corpus/SequenceIndexBuilder.java
src/cunei/lexicon/Lexicon.java
src/cunei/lexicon/LexiconSerializer.java
src/cunei/lexicon/Lexicons.java
src/cunei/sa/SuffixArray.java
src/cunei/sa/UnorderedSuffixArrayComparator.java
src/cunei/type/Annotation.java
src/cunei/type/Type.java
src/cunei/type/TypeIndex.java
src/cunei/type/TypeSequence.java
src/cunei/type/UncasedString.java
Added Paths:
-----------
src/cunei/lexicon/CorpusLexicon.java
src/cunei/lexicon/ExternalLexicon.java
Removed Paths:
-------------
src/cunei/type/SessionTypeIndex.java
src/cunei/util/TypeBoundIndex.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2011-07-19 16:02:05
|
Revision: 777
http://cunei.svn.sourceforge.net/cunei/?rev=777&view=rev
Author: aaronbphillips
Date: 2011-07-19 16:01:59 +0000 (Tue, 19 Jul 2011)
Log Message:
-----------
Reduce default size of arrays in PositionLocator
Modified Paths:
--------------
src/cunei/corpus/PositionLocator.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|