The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Download Latest Version KanjiWordAssociationTool_v2.2_Source_Code.7z (3.2 MB)
Email in envelope

Get an email when there's a new version of Kanji Word Association Tool

Home
Name Modified Size InfoDownloads / Week
KanjiWordAssociationTool_v2.2 2013-10-27
KanjiWordAssociationTool_v2.1 2013-07-20
KanjiWordAssociationTool_v2.0 2013-07-14
DictionaryGenerator 2013-07-13
KanjiWordAssociationTool_v1.0 2013-06-30
readme.txt 2013-10-27 7.1 kB
Totals: 6 Items   7.1 kB 1
Kanji Word Association Tool
---------------------------

Description:
------------
This tool was created for students who want to learn kanji and words at the same time in the most
optimal fashion possible. Based on a user-provided list of kanji, this tool will generate a list of
words that are associated with each kanji and ensure that each word consists only of kanji that you
have already studied up to that point and kana. In addition, words are sorted by frequency and no
duplicate words are used.

For example, assume that 径 is the 882nd kanji in the user-provided kanji list and that we are
using the default options. The words that will be generated for 径 will only contain kanji from the
1st kanji in the list to the 882nd kanji in the list.


How to Install/Launch (Windows Users):
--------------------------------------
1) Unzip Kanji Word Association Tool.
2) In the unzipped directory, simply double-click JapaneseTextAnalysisTool.exe.
   Note: Requires .Net Framework 3.5.


Options:
--------

* File that contains the list of kanji (one kanji per line):

  The user-provided list of kanji. If a line contains multiple tab-separated columns, you may
  specify which column contains the kanji. Several kanji lists come pre-packaged in the
  "Sample_Kanji_Lists" directory:

    kanji_in_jtat_novel_frequency_order.txt - Kanji are listed based on Japanese Text Analysis Tool's
                                              frequency report for a corpus of 5000+ Japanese novels.
    kanji_in_kanjidic_frequency_order.txt   - Kanji are listed based on KANJIDIC frequency.
    rtk_1_and_3_kanji.txt                   - Kanji are listed in Remembering the Kanji 1 and 3 order.
    rtk_1_kanji_only.txt                    - Kanji are listed in Remembering the Kanji 1 order.
    rtk_lite.txt                            - Kanji are listed on the user community's "lite"
                                              version of Remembering the Kanji 1.

* Look-ahead range:

  When determining which entries to associate with the current kanji being processed, one of the
  following look-ahead range options is used, where N is the number of the current kanji being
  processed in the kanji list, and R is a value that is specified by the user:

  - Don't lookahead (default). Range: [1 to N].
    Only associate the current kanji being processed with words that contain a kanji that has already
    been encountered. Example: For N = 367, the range would be [1-367].

  - Dynamic. Range: [1 to N + R].
    Associate the current kanji being processed with any words that contain a kanji that has already
    been encountered or with a kanji that is the user-specified number of kanji ahead of the current
    kanji. Example: For N = 367 and R = 100, the range would be [1 to 467].

  - Step. Range: [1 to get_next_multiple(N, R)].
    Associate the current kanji being processed with any words that contain a kanji that has already
    been encountered or with a kanji that is within the next multiple of R. Example:
    For N = 367 and R = 100, the range would be [1 to 400].

* [optional] File that contains a list of words that you already know (one word per line):

  Words in this list will not be added to the output file. If a line contains multiple tab-separated
  columns, you may specify which column contains the word

* Don't associate words that consist of only a single kanji

  Example: For the kanji 旭, ignore the entry "旭 (あさひ) morning sun" because the expression is
  only a single kanji.

* Maximum number of entries to associate with each kanji:

  Hopefully self-explanatory.

* Entry format:

  The format of each entry in the output file using these tokens:
    Definition  %d    Reading  %r
    Expression  %e    Tab      %t
    Frequency   %f

  Example: %e%t%r%t%f%t%d tells this tool to format each entry like this:
           Expression[tab]Reading[tab]Frequency[tab]Definition

* Add kanji to the output file

  Example: If 調 is the current kanji being processed, add 調 to the output file in a separate column.

* Add kanji number to the output file

  Kanji number is the nth position of the kanji in the kanji list input file.

  Example: Assume that 調 is the 349th kanji in the kanji list file. When 調 is processed and this
  option is checked, the number 349 will be added to a separate column in the output file.

* Output file format:

  The format of the output file. There are two choices:
    Place all entries on a single line
    Place entries on separate lines

* Output directory:

  Directory where the output file(s) will be placed. There are 2 output files:
    1) kwat_kanji_words.tsv - Contains list of kanji and associated entries.
    2) kwat_kanji_without_associations.txt - Contains list of kanji that have no associated entries.

Advanced Options:
-----------------

* Dictionary file

  Specify the dict.dat file to use. You can create your own dict.dat file with the
  Kanji Word Association Tool Dictionary Generator:
  https://sourceforge.net/projects/kanjiwordassoci/files/DictionaryGenerator/
  
* Allow associations to entries consisting of any kanji (not just those already encountered or
  those in the user's kanji list)
  
  Warning: checking this option defeats the primary intent of this program, that is, to only 
  generate words based on the kanji that the user already knows.
    
* Output a list of kanji in the user's kanji list that were not used by any of the associated entries

  This output file is named "kwat_unused_kanji.txt".


Contact:
--------
Christopher Brochtrup
cb4960@gmail.com


Special Thanks:
---------------
nest0r
louischa

--------------------------------------------------------------------------------

Version History:
----------------

[Version 2.2 (27-October-2013)]
- Updated dict.dat.

[Version 2.1 (19-July-2013)]
- Added advanced option: "Allow associations to entries consisting of any kanji (not just
  those already encountered or those in the user's kanji list)"
- Added advanced option: "Output a list of kanji in the user's kanji list that were not used
  by any of the associated entries"
- Fixed bug that prevented entries without frequency from being used for associations.
- Fixed bug where the selection of the dictionary file (in Advanced Options) could only be
  performed once.

[Version 2.0 (13-July-2013)]
- Added column selectors to the kanji list and known words files.
- Placeholders are no longer generated when "Place all entries on a single line" is selected.
- Added "kwat_kanji_without_associations.txt" output file.
- Added "Advanced options..." link.
- Added "Advanced Options" dialog.
- Added advanced option to specify which dictionary file to use.
- Changed "Output file" to "Output directory".
- The output file is now named "kwat_kanji_words.txt" and cannot be changed by the user.

[Version 1.0 (29-June-2013)]
- Initial version.
Source: readme.txt, updated 2013-10-27