cunei-commits Mailing List for Cunei Machine Translation Platform (Page 33)
Status: Beta
Brought to you by:
aaronbphillips
You can subscribe to this list here.
| 2009 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(96) |
Jul
(161) |
Aug
(12) |
Sep
(18) |
Oct
(17) |
Nov
(6) |
Dec
(8) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2010 |
Jan
(28) |
Feb
(18) |
Mar
(26) |
Apr
(37) |
May
(37) |
Jun
(10) |
Jul
(5) |
Aug
(41) |
Sep
(11) |
Oct
(32) |
Nov
(30) |
Dec
(28) |
| 2011 |
Jan
(29) |
Feb
(16) |
Mar
(23) |
Apr
(40) |
May
(15) |
Jun
(14) |
Jul
(24) |
Aug
(7) |
Sep
(16) |
Oct
(4) |
Nov
(12) |
Dec
|
| 2012 |
Jan
|
Feb
(4) |
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
|
From: <aar...@us...> - 2009-06-18 16:28:00
|
Revision: 26
http://cunei.svn.sourceforge.net/cunei/?rev=26&view=rev
Author: aaronbphillips
Date: 2009-06-18 16:27:51 +0000 (Thu, 18 Jun 2009)
Log Message:
-----------
Added new class ManagedLongBuffer that creates long-aligned arrays that can be memory-mapped. Lexicon induction now uses a large UnsignedHash and does one pass over each TypeSequence in the corpus.
Modified Paths:
--------------
src/cunei/bits/BitBuffer.java
src/cunei/bits/ManagedBuffer.java
src/cunei/bits/ManagedIntBuffer.java
src/cunei/bits/MemoryMappedBitBuffer.java
src/cunei/bits/MemoryMappedIntBuffer.java
src/cunei/bits/ResizableUnsignedHash.java
src/cunei/bits/UnsignedArray.java
src/cunei/bits/UnsignedHash.java
src/cunei/corpus/CorpusReader.java
src/cunei/corpus/MultilingualCorpus.java
src/cunei/sort/Sorter.java
src/cunei/type/TypeIndex.java
Added Paths:
-----------
src/cunei/bits/ManagedLongBuffer.java
src/cunei/bits/ManagedNIOBuffer.java
src/cunei/bits/MemoryMappedLongBuffer.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-17 21:47:50
|
Revision: 25
http://cunei.svn.sourceforge.net/cunei/?rev=25&view=rev
Author: aaronbphillips
Date: 2009-06-17 21:47:29 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
Reading and writing MultilingualCorpus is now working. Some of the corpus output files have been renamed (but re-indexing is required anyways). Additionally, a minor bug was fixed in ManagedBitBuffer to allow for a zero length array.
Modified Paths:
--------------
src/cunei/alignment/AlignmentIndex.java
src/cunei/bits/ManagedBitBuffer.java
src/cunei/corpus/CorpusInformation.java
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/MultilingualCorpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-17 20:47:54
|
Revision: 24
http://cunei.svn.sourceforge.net/cunei/?rev=24&view=rev
Author: aaronbphillips
Date: 2009-06-17 20:47:52 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
BROKEN: Replaced Corpus with MultilingualCorpus. Re-indexing will be required. Code currently compiles but there are still some issues with indexing.
Modified Paths:
--------------
src/cunei/cli/Decode.java
src/cunei/cli/EstimateLexiconAlignment.java
src/cunei/cli/EstimateSentenceRatios.java
src/cunei/cli/Evaluate.java
src/cunei/cli/IndexCorpus.java
src/cunei/cli/IndexPanliteCorpus.java
src/cunei/cli/Optimize.java
src/cunei/cli/ProcessPanliteCorpus.java
src/cunei/cli/Translate.java
src/cunei/cli/Unknown.java
src/cunei/corpus/Context.java
src/cunei/corpus/CorpusInformation.java
src/cunei/corpus/CorpusSerializer.java
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/Origins.java
src/cunei/optimize/Optimizer.java
src/cunei/translate/Hypothesis.java
src/cunei/translate/Match.java
src/cunei/translate/PhraseModel.java
src/cunei/translate/Similarity.java
src/cunei/translate/Translator.java
src/cunei/ui/AlignmentWindow.java
src/cunei/ui/Workbench.java
Added Paths:
-----------
src/cunei/corpus/MultilingualCorpus.java
Removed Paths:
-------------
src/cunei/corpus/Corpus.java
src/cunei/corpus/MultilingualCorpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <ral...@us...> - 2009-06-17 19:45:20
|
Revision: 23
http://cunei.svn.sourceforge.net/cunei/?rev=23&view=rev
Author: ralfbrown
Date: 2009-06-17 19:45:16 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
Fixed unused-X warnings from Eclipse, made MultilingualCorpus a derivative of CorpusInformation instead of having an instance of it as a member to eliminate a bunch of wrapper functions which simply called through to the embedded CorpusInformation instance.
Modified Paths:
--------------
src/cunei/corpus/CorpusInformation.java
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/MultilingualCorpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <ral...@us...> - 2009-06-17 16:50:08
|
Revision: 22
http://cunei.svn.sourceforge.net/cunei/?rev=22&view=rev
Author: ralfbrown
Date: 2009-06-17 16:49:54 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
Split class Corpus into MonolingualCorpus, MultilingualCorpus, and CorpusInformation, and added alternate version of Origins.mergeExampleFeatures
Modified Paths:
--------------
src/cunei/corpus/Origins.java
Added Paths:
-----------
src/cunei/corpus/CorpusInformation.java
src/cunei/corpus/MonolingualCorpus.java
src/cunei/corpus/MultilingualCorpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-17 15:29:52
|
Revision: 21
http://cunei.svn.sourceforge.net/cunei/?rev=21&view=rev
Author: aaronbphillips
Date: 2009-06-17 15:29:51 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
Alignment probabilities set by the lexicon induction are now the product of all lexicons (previously it was a summation which produced probabilities greater than 1).
Modified Paths:
--------------
src/cunei/corpus/Corpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-17 14:28:07
|
Revision: 20
http://cunei.svn.sourceforge.net/cunei/?rev=20&view=rev
Author: aaronbphillips
Date: 2009-06-17 14:28:04 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
Added simple user interface for viewing corpus alignments
Modified Paths:
--------------
launch/Translate.launch
src/cunei/cli/CommandLineInterface.java
src/cunei/corpus/Corpus.java
Added Paths:
-----------
src/cunei/ui/
src/cunei/ui/AlignmentWindow.java
src/cunei/ui/Workbench.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-17 14:27:20
|
Revision: 19
http://cunei.svn.sourceforge.net/cunei/?rev=19&view=rev
Author: aaronbphillips
Date: 2009-06-17 14:27:17 +0000 (Wed, 17 Jun 2009)
Log Message:
-----------
Fixed bug in corpus processing where load flag was not handled properly
Modified Paths:
--------------
src/cunei/cli/ProcessCorpus.java
src/cunei/cli/ProcessPanliteCorpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-16 14:44:57
|
Revision: 18
http://cunei.svn.sourceforge.net/cunei/?rev=18&view=rev
Author: aaronbphillips
Date: 2009-06-16 14:44:47 +0000 (Tue, 16 Jun 2009)
Log Message:
-----------
Fixed bug in source and target length of gapped alignment chunks. Previously, unknown words were not included in the calculation.
Modified Paths:
--------------
src/cunei/alignment/SubPhraseAlignment.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-15 21:21:20
|
Revision: 17
http://cunei.svn.sourceforge.net/cunei/?rev=17&view=rev
Author: aaronbphillips
Date: 2009-06-15 21:21:19 +0000 (Mon, 15 Jun 2009)
Log Message:
-----------
Added class to handle European numbers that reverse commas and periods
Modified Paths:
--------------
launch/Translate.launch
src/cunei/processors/ReverseSourceNumbers.java
systems/fr-en/default/config
systems/ur-en/default/config
Added Paths:
-----------
src/cunei/processors/EuropeanNumbers.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-15 17:23:31
|
Revision: 16
http://cunei.svn.sourceforge.net/cunei/?rev=16&view=rev
Author: aaronbphillips
Date: 2009-06-15 17:23:16 +0000 (Mon, 15 Jun 2009)
Log Message:
-----------
Fixed bug in release script so release version is correctly saved to RELEASE
Modified Paths:
--------------
bin/make-release.sh
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-15 15:32:19
|
Revision: 15
http://cunei.svn.sourceforge.net/cunei/?rev=15&view=rev
Author: aaronbphillips
Date: 2009-06-15 15:32:17 +0000 (Mon, 15 Jun 2009)
Log Message:
-----------
Release
Modified Paths:
--------------
.project
bin/cunei.sh
bin/make-release.sh
bin/update.sh
build.xml
launch/Translate.launch
Added Paths:
-----------
RELEASE
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-15 15:06:09
|
Revision: 14
http://cunei.svn.sourceforge.net/cunei/?rev=14&view=rev
Author: aaronbphillips
Date: 2009-06-15 15:05:59 +0000 (Mon, 15 Jun 2009)
Log Message:
-----------
Updated helper scripts to work with a jar file
Modified Paths:
--------------
bin/cunei.sh
Added Paths:
-----------
bin/make-release.sh
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-11 14:07:34
|
Revision: 13
http://cunei.svn.sourceforge.net/cunei/?rev=13&view=rev
Author: aaronbphillips
Date: 2009-06-11 14:07:13 +0000 (Thu, 11 Jun 2009)
Log Message:
-----------
Renamed previous XMLDecoderWriter to RosettaDecoderWriter and created a new XMLDecoderWriter whose format is more human-readable and displays all available data in each Hypothesis.
Modified Paths:
--------------
src/cunei/decode/DecoderWriters.java
src/cunei/decode/XMLDecoderWriter.java
src/cunei/translate/Translation.java
Added Paths:
-----------
src/cunei/decode/RosettaDecoderWriter.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-11 04:45:16
|
Revision: 12
http://cunei.svn.sourceforge.net/cunei/?rev=12&view=rev
Author: aaronbphillips
Date: 2009-06-11 04:44:15 +0000 (Thu, 11 Jun 2009)
Log Message:
-----------
Fixed bug in ConfusionPath that produced a backward source Phrase.
Modified Paths:
--------------
src/cunei/confusion/ConfusionPath.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-11 03:45:58
|
Revision: 11
http://cunei.svn.sourceforge.net/cunei/?rev=11&view=rev
Author: aaronbphillips
Date: 2009-06-11 03:45:13 +0000 (Thu, 11 Jun 2009)
Log Message:
-----------
Fixed XMLConfusionReader to properly read 'id' attribute in 'sentence' tag.
Modified Paths:
--------------
src/cunei/config/ClassConfiguration.java
src/cunei/confusion/XMLConfusionReader.java
src/cunei/translate/Translator.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-10 15:46:19
|
Revision: 10
http://cunei.svn.sourceforge.net/cunei/?rev=10&view=rev
Author: aaronbphillips
Date: 2009-06-10 15:46:11 +0000 (Wed, 10 Jun 2009)
Log Message:
-----------
Modified alignment utility script to handle compressed files. Fixed language model utility script to properly reference megabytes.
Modified Paths:
--------------
bin/align-giza.sh
bin/build-lm.sh
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-10 14:52:32
|
Revision: 9
http://cunei.svn.sourceforge.net/cunei/?rev=9&view=rev
Author: aaronbphillips
Date: 2009-06-10 13:51:21 +0000 (Wed, 10 Jun 2009)
Log Message:
-----------
MultiFileCorpusWriter now properly closes file handles when it completes.
Modified Paths:
--------------
src/cunei/corpus/MultiFileCorpusWriter.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-10 14:50:21
|
Revision: 8
http://cunei.svn.sourceforge.net/cunei/?rev=8&view=rev
Author: aaronbphillips
Date: 2009-06-10 13:38:09 +0000 (Wed, 10 Jun 2009)
Log Message:
-----------
The BufferedReaderManager will only attempt to load a compressed GZIP stream if the file name ends in '.gz'. Likewise, the PrintStreamWriter now writes a compressed GZIP stream when the file name ends in '.gz'.
Modified Paths:
--------------
src/cunei/util/BufferedReaderManager.java
src/cunei/util/PrintStreamManager.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-10 13:15:33
|
Revision: 7
http://cunei.svn.sourceforge.net/cunei/?rev=7&view=rev
Author: aaronbphillips
Date: 2009-06-10 13:15:32 +0000 (Wed, 10 Jun 2009)
Log Message:
-----------
Updated default Urdu-English configuration
Modified Paths:
--------------
systems/ur-en/default/config
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-10 02:53:34
|
Revision: 6
http://cunei.svn.sourceforge.net/cunei/?rev=6&view=rev
Author: aaronbphillips
Date: 2009-06-10 00:41:33 +0000 (Wed, 10 Jun 2009)
Log Message:
-----------
Added a DocumentReaders class and modified command line Process* utilities to allow for reading in different types of documents. Changed corpus Process* utilities to allow for loading of the indexed corpus.
Modified Paths:
--------------
src/cunei/cli/ProcessCorpus.java
src/cunei/cli/ProcessDocument.java
Added Paths:
-----------
src/cunei/cli/ProcessPanliteCorpus.java
src/cunei/document/DocumentReaders.java
Removed Paths:
-------------
src/cunei/cli/ExportPanliteCorpus.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-10 02:41:02
|
Revision: 5
http://cunei.svn.sourceforge.net/cunei/?rev=5&view=rev
Author: aaronbphillips
Date: 2009-06-10 00:33:13 +0000 (Wed, 10 Jun 2009)
Log Message:
-----------
Changed TypesOfTypes so it throws an IllegalArgumentException instead of a generic RuntimeExeption when the type name is unknown. Modified XMLConfusionReader so that it does not die if extra type information is specified.
Modified Paths:
--------------
src/cunei/confusion/XMLConfusionReader.java
src/cunei/type/TypesOfTypes.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-06 03:50:08
|
Revision: 4
http://cunei.svn.sourceforge.net/cunei/?rev=4&view=rev
Author: aaronbphillips
Date: 2009-06-06 03:50:07 +0000 (Sat, 06 Jun 2009)
Log Message:
-----------
Fixed several bugs in gapped matching. Sentences seem to decode without errors now, but without any target constraints (and likely poor feature weights) I am seeing a *lot* of gaps being filled with an epsilon translation.
Modified Paths:
--------------
src/cunei/alignment/PhraseAlignment.java
src/cunei/decode/ChartHypothesisBuilder.java
src/cunei/decode/Decoder.java
src/cunei/document/Phrase.java
src/cunei/lattice/ScoredSet.java
src/cunei/lm/Ngram.java
src/cunei/processors/ReverseSourceNumbers.java
src/cunei/translate/Hypothesis.java
src/cunei/translate/Match.java
src/cunei/translate/Similarity.java
src/cunei/translate/Translation.java
src/cunei/translate/Translator.java
src/cunei/type/TypeSequence.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-05 22:36:42
|
Revision: 3
http://cunei.svn.sourceforge.net/cunei/?rev=3&view=rev
Author: aaronbphillips
Date: 2009-06-05 22:36:40 +0000 (Fri, 05 Jun 2009)
Log Message:
-----------
Removed InputPhrase and instead passed around input with a ConfusionPath. It may be worthwhile to revisit this decision later. Gaps are now being created throughout the Translator, but they are always unconstrained (target phrase is null). Still need to verify this works with the Decoder and apply constraints to the gaps.
Modified Paths:
--------------
launch/Decode.launch
src/cunei/confusion/ConfusionPath.java
src/cunei/decode/ChartHypothesisBuilder.java
src/cunei/document/Phrase.java
src/cunei/translate/Match.java
src/cunei/translate/Similarity.java
src/cunei/translate/Translation.java
src/cunei/type/TypeSequence.java
Removed Paths:
-------------
src/cunei/confusion/InputPhrase.java
Property Changed:
----------------
/
data/
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|
|
From: <aar...@us...> - 2009-06-05 16:24:33
|
Revision: 2
http://cunei.svn.sourceforge.net/cunei/?rev=2&view=rev
Author: aaronbphillips
Date: 2009-06-05 16:24:26 +0000 (Fri, 05 Jun 2009)
Log Message:
-----------
Added InputPhrase class which extends the Phrase class to provide coverage information. The InputPhrase will be used by the ConfusionNode, Similarity, and Translation classes as a unified mechanism for tracking the input.
Modified Paths:
--------------
src/cunei/alignment/PhraseAlignment.java
src/cunei/cli/ScoreLanguageModel.java
src/cunei/confusion/DocumentConfusionReader.java
src/cunei/corpus/Context.java
src/cunei/corpus/Corpus.java
src/cunei/corpus/CorpusIndexBuilder.java
src/cunei/corpus/MultiFileCorpusReader.java
src/cunei/decode/HypothesisBuilder.java
src/cunei/document/Phrase.java
src/cunei/lexicon/Lexicons.java
src/cunei/lm/BackoffLanguageModel.java
src/cunei/processors/SentenceEliminator.java
src/cunei/translate/Match.java
src/cunei/translate/PhraseModel.java
src/cunei/translate/Similarity.java
src/cunei/translate/Translator.java
src/cunei/type/TypeSequence.java
Added Paths:
-----------
src/cunei/confusion/InputPhrase.java
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|