2025-06-10 TANAKA Takuji * fwrite.c: Better treatment of Ŋ,ŋ in Finnish. Better treatment of Þ,þ in Danish, Norwegian, Swedish and Finnish. Better treatment of Ę,ę in Norwegian and Swedish. Better treatment of Ð,ð,Đ,đ in Danish, Norwegian, Finnish, Swedish and Icelandic. Better treatment of Ü,ü,Ű,ű in Danish, Norwegian, Swedish and Finnish. Better treatment of Ö,ö,Ø,ø,Ő,ő,Ô,ô,Œ,œ,Æ,æ,Ä,ä in Swedish. 2025-06-01 TANAKA Takuji * fwrite.c: Support digraph AA and better treatment of Ø,ø,Ö,ö,Æ,æ,Ä,ä in Danish and Norwegian. Better treatment of Ø,ø,Ö,ö,Æ,æ,Ä,ä in Icelandic. Better treatment of Ö,ö,Ø,ø,Ő,ő,Õ,õ,Ü,ü,Ű,ű,Œ,œ,Æ,æ,Ä,ä in Finnish. 2025-05-29 TANAKA Takuji * version 1.20 Stable version. * configure.ac: Bump version. * main.c, sort.c, convert.c, fwrite.c, styfile.c, kana.h, {,ex}var.h, mendex.h: Support Brahmic (Bengali, Gurmukhi, Gujarati, Kannada, Lao, Malayalam, Oriya, Sinhala, Tamil and Telugu) scripts (experimental). Add options "script_head" in style file. https://github.com/t-tk/upmendex-package/issues/14 * styfile.c: Remove deprecated keyword "tumunja" from style file. * fwrite.c: Enhance treatment of S,T comma below in Romanian. Support V-W ordering in Finnish traditional collation. * sort.c: Prepare to support CJK Unified Ideographs Extension J in Unicode 17.0. It will work with future version of ICU. Treat U+3192..3195, U+3280..3289, U+A830..A835 as "other number". 2025-03-07 Karl Berry * TL'25 release. 2025-02-19 TANAKA Takuji * convert.c, kp.{c,h}, styfile.c: Use kpse_find_file to find style files. Thanks to Yukimasa Morimi. * tests/upmendex.test: Add tests. https://github.com/texjporg/tex-jp-build/issues/175 2025-02-10 TANAKA Takuji * version 1.11 Stable version. * configure.ac: Bump version. * fread.c: Improve compatibility with makeindex. * styfile.c: Fix escaping of string parameters. Thanks to Yukimasa Morimi. https://github.com/texjporg/tex-jp-build/issues/173 * tests/upmendex.test, tests/head1.ist, tests/tort{,W}.idx, tests/ok-tort{,2}.ind: Add tests for encap escape & heading_{pre,post}fix. * Makefile.am: Adjust. 2025-01-25 Karl Berry * COPYRIGHT, * main.c: 2025. 2024-11-09 TANAKA Takuji * version 1.10 Stable version. * configure.ac: Bump version. * {mendex,var,exvar}.h, fread.c, fwrite.c, sort.c, styfile.c: Support for indices with four sublevels, conventionally two sublevels. Thanks to pull request by Sergei Golovan. https://github.com/t-tk/upmendex-package/pull/9 * fread.c: Improve error message when reading index entries. https://github.com/t-tk/upmendex-package/issues/13 * sort.c: Extend RULEBUFSIZE to adjust ICU 76.1. Revise to avoid compiler warnings. * convert.c: Tweak comment. * tests/{upmendex,pprec,icu}.test: Split test scripts. * tests/nest.test, tests/nested{,3}.{ist,idx,ind}, tests/toodeep.idx: Add new tests for nested depth. * tests/upmendex.test, tests/tort{,W}.idx, tests/ok-tort.ind: Add new tests for error/warning. * Makefile.am: Adjust. 2024-07-06 TANAKA Takuji * version 1.09 Stable version. * configure.ac: Bump version. * fwrite.c: Improve heading of Ö,ö,Ő,ő,Ü,ü,Ű,ű in Hungarian. Thanks to @hair-splitter for the report. https://github.com/t-tk/upmendex-package/issues/10 2024-03-10 Karl Berry * TL'24 release. 2024-02-05 Karl Berry * COPYRIGHT, * main.c: 2024. 2023-12-16 TANAKA Takuji * version 1.08 Stable version. * configure.ac: Bump version. * sort.c: Warn if remaining buffer is small for custom collation rules when "icu_locale" and "icu_rules" are specified in style file. Add CJK Unified Ideograph Extension I in Unicode 15.1. It works with ICU 74.1 or later. * tests/upmendex.test, tests/locale3.ist, tests/locale3-rule.ind, Makefile.am: Update tests. 2023-08-11 TANAKA Takuji * tests/upmendex.test: Make easier to test on Windows. 2023-07-05 TANAKA Takuji * fwrite.c, mendex.h: Enable to build with system ICU DLL icu.dll for Windows 10 version 1903 (May 2019 Update) or later. https://learn.microsoft.com/en-us/windows/win32/intl/international-components-for-unicode--icu- 2023-03-09 Karl Berry * TL'23 release. 2023-02-26 TANAKA Takuji * hanzi.h: Update references. 2022-11-20 TANAKA Takuji * version 1.07 Stable version. * configure.ac: Bump version. * sort.c, fwrite.c, {,ex}hanzi.h: Improve detection of hanzi header to support Chinese polyphone (多音字). 2022-11-16 TANAKA Takuji * version 1.06 Stable version. * configure.ac: Bump version. * sort.c: Add parameter option "@" to character_order, which denotes character order is defined by icu_rules. * fwrite.c: Better treatment for Marathi KSSA, JNYA. * upmendex.ja.txt: Update document. 2022-10-08 TANAKA Takuji * version 1.05 Stable version. * configure.ac: Bump version. * main.c, sort.c, convert.c, fwrite.c, {,ex}kana.h, mendex.h: Support U+1B001 Hiragana Letter Archaic Ye. * fwrite.c: Strict check for dotted/dotless I/i in Turkish. Strict check for Thai reordering. 2022-09-17 TANAKA Takuji * fwrite.c: Fix a bug on Thai words started with vowel followed by consonant. 2022-09-11 TANAKA Takuji * version 1.03 Stable version. * configure.ac: Bump version. * main.c, sort.c, convert.c, fwrite.c, kana.h: Support U+1B11F..1B122 Hiragana/Katakana Letter Archaic Wu, Yi, Ye. 2022-09-04 TANAKA Takuji * version 1.02 Stable version. * configure.ac: Bump version. * fwrite.c: Support U+1B132, U+1B155 Hiragana/Katakana Letter Small Ko. They have not been supported yet but will be supported in ICU collator. Support U+0626 Arabic Letter Yeh with Hamza Above for Uyghur. Support U+0149 Latin Small Letter N Preceded By Apostrophe for Afrikaans. Support digraphs dd, ff, ng, ph, rh for Welsh. * sort.c: Add Latin Extended-F and -G. Add CJK Unified Ideograph Extension H, Cyrillic Extended-D, Devanagari Extended-A and Arabic Extended-C in Unicode 15.0. 2022-09-02 TANAKA Takuji * main.c: Moved calls of KP_entry_filetype() after checking options. This change prevents unwanted looking for texmf.cnf. https://github.com/texjporg/tex-jp-build/pull/144 2022-05-22 TANAKA Takuji * version 1.01 Stable version. * configure.ac: Bump version. * fread.c: Modify heuristic detection of page number types for ambiguous ones between roman number and alphabet. * tests/upmendex.test, tests/pprec[567].ist, tests/romalp[ABCD].idx, tests/romalpA-[56].ind, tests/romalp[BD]-[567].ind, tests/romalpC-5.ind, Makefile.am: Update tests. 2022-03-21 Karl Berry * TL'22 release. 2022-03-07 Hironobu Yamashita * fwrite.c: output to file instead of accumulating too much. https://github.com/texjporg/tex-jp-build/issues/137 2022-01-18 Karl Berry * COPYRIGHT, * main.c: 2022. 2021-11-20 TANAKA Takuji * version 1.00 Stable version. * configure.ac: Bump version. * {,ex}var.h, fwrite.c, styfile.c: Add options "script_preamble" and "script_postamble" in style file. * sort.c: Add Latin Extended-F and -G. Add U_FORMAT_CHAR to is_type_mark_or_punct(). * mendex.h, var.h, convert.c, fwrite.c, sort.c, styfile.c: Supports Arabic, Hebrew scripts (experimental). * upmendex.ja.txt: Update document. 2021-10-24 TANAKA Takuji * version 0.60 Beta version. * configure.ac: Bump version. * Makefile.am, convert.c, fwrite.c, sort.c, {,ex}kana.h, ktable.h, mendex.h: Enhance procedure of classifying characters: Newly classifies by character type of General Category in Unicode Script Property. https://github.com/t-tk/upmendex-package/issues/8 2021-09-19 TANAKA Takuji * fread.c: Modify heuristic detection of page number types. * tests/upmendex.test, tests/range.idx, tests/range[123].ind, tests/pprecB.idx, tests/pprec[34].ist, tests/pprecB-[034].ind: Update tests. 2021-09-18 TANAKA Takuji * fread.c, fwrite.c, sort.c, mendex.h, {,ex}var.h: Extend levels of page numbers from 3 to 10 for compatibility with makeindex. Fix a bug of page ranges with multi-level page numbers. * fread.c: Strict check for "page_precedence" and "page_compositor" in style file. * fwrite.c: Do not break lines before "delim_n". * tests/upmendex.test, tests/range.idx, tests/range[123].{ist,ind}, removed tests/rangetwo.*: Update tests. 2021-09-05 TANAKA Takuji * fwrite.c: Support I-Y ordering in Lithuanian. 2021-09-04 TANAKA Takuji * version 0.59 Beta version. * configure.ac: Bump version. * fwrite.c, styfile.c, {,ex}var.h: Support output order of symbols and numbers separately when "symbol_flag" is 2 in style file. * sort.c: Accept combination of "icu_locale" and "icu_rules" in style file. If simultaneously specified, collation rules specified by "icu_rules" are added on collation rules specified by "icu_locale". * tests/upmendex.test, tests/rule.{idx,ind}, tests/locale2.ist, tests/{locale1,attri1,locale2-rule}.ind: Add a new test and update conventional tests. 2021-08-29 TANAKA Takuji * styfile.c: Strict check for "icu_attributes" in style file. * sort.c: Strict check for "character_order" in style file. * fwrite.c: Fix a bug when "symbol_flag" is 0. * tests/upmendex.test, tests/attr1.{ist,ind}, tests/locale1.ind, tests/rule.{idx,ind}: Add a new test and update conventional tests. 2021-08-28 TANAKA Takuji * styfile.c: Add a alias "headings_flag" of the keyword "heading_flag" in style file for compatibility with makeindex. Warn if specifier in style file is unknown. 2021-08-26 TANAKA Takuji * sort.c: Initialize a variable. * tests/upmendex.test, tests/locale1.{ist,ind}: Add a new test for "icu_locale". 2021-08-25 TANAKA Takuji * sort.c: Initialize a string. Check malformed input for "icu_rules". * tests/upmendex.test, tests/rule[34].ist: Add new tests. 2021-08-23 TANAKA Takuji * tests/upmendex.test, tests/rule[12].ist, tests/rule.{idx,ind}: Add new tests for the keyword "icu_rules" in style file. 2021-08-08 TANAKA Takuji * version 0.58 Beta version. * configure.ac: Bump version. * main.c, convert.c, fwrite.c, sort.c, styfile.c, mendex.h, {,ex}kana.h, {,ex}var.h: Supports Devanagari, Thai scripts (experimental). Add keywords "devanagari_head", "thai_head" to style file. Add a keyword "hangul_head" to style file, let "tumumja" deprecated. * styfile.c: Add an option "numeric-ordering" to the keyword "icu_attributes" in style file. * upmendex.ja.txt: Update document. 2021-06-19 Karl Berry * configure.ac: pthread doc update; also needed in bibtexu (q.v.). 2021-06-19 TANAKA Takuji * configure.ac: variadic checks before AC_PROG_CXX, to avoid error of C++ compiler detection in some environments. Report from Vadim Kantorov, https://github.com/t-tk/upmendex-package/issues/1 * main.c: Add ICU version to banner. 2021-06-13 TANAKA Takuji Same fixes as for mendex. * fwrite.c: Avoid segmentation fault for unwritable fopen. * pageread.c, styfile.c: Exit code 255 for non-existent style/log file. 2021-06-06 TANAKA Takuji * version 0.57 Beta version. * configure.ac: Bump version. * main.c, styfile.c: Accept multiple numbers of style files by option '-s'. * main.c, sort.c, styfile.c, fwrite.c, {,ex}kana.h, {,ex}var.h: Add a keyword "kana_head" to style file. Treat "Circled Katakana" as kana. * tests/upmendex.test, tests/uni[12].ist, tests/rangetwo.{idx,ind,ist}: Add new tests. * upmendex.ja.txt: Update document. Ref. https://github.com/texjporg/tex-jp-build/issues/116 2021-05-19 TANAKA Takuji * version 0.56 Beta version. * configure.ac: Bump version. * styfile.c: Fix a bug of reading keywords "setpage_prefix", "setpage_suffix" in style file. More strictly check string length. 2021-05-09 Karl Berry * mendex.h (page.attr): use int, not char, since we do signed comparison and aarch64 apparently treats char as unsigned. (index.words): use unsigned char for consistency. Same fixes as for mendex; see mendexk/ChangeLog. Report from Johannes Hielscher, https://tug.org/pipermail/tlbuild/2021q1/004873.html and follow-up: https://tug.org/pipermail/tlbuild/2021q2/004911.html 2021-03-23 Karl Berry * TL'21. 2021-03-21 TANAKA Takuji * version 0.55 Beta version. * configure.ac: Bump version. * fread.c: Fix a bug about page_precedence in style. Thanks to Suzana Sagadin for the report. * tests/pprec[012].ist, tests/pprecA.idx, tests/pprecA-[012].ind, tests/upmendex.test, Makefile.am: Add new tests. 2021-02-17 Karl Berry * COPYRIGHT, * main.c: copyright 2021. 2020-05-01 TANAKA Takuji * main.c, version.h, Makefile.am: Add bug report addresses on command line help. * convert.c: Tune buffer sizes to avoid tool warnings. 2020-02-22 TANAKA Takuji * version 0.54 Beta version. * configure.ac, main.c, COPYRIGHT: Bump version and copyright year. * {convert,fwrite,sort}.c: Support Small Kana Extension in Unicode 12.0. Add CJK Unified Ideographs Extension G. Update samples. 2020-01-16 Karl Berry * configure.ac, * COPYRIGHT, * main.c (VERSION): version 0.53 for TL'20. 2019-12-12 Karl Berry * configure.ac: look for a pthread library cntaining pthread_create (using KPSE_SEARCH_LIBS) and set PTHREAD_LIB if found. * Makefile.am (LDADD): include $(PTHREAD_LIB). As of ICU 65.1, upmendex requires -lpthread. Otherwise, it crashes at sort.c:82, which merely creates the icu_collator: icu_collator = ucol_open(icu_locale, &status); About 10 stack frames deep into the ICU initialization, this winds up calling (Build/source/libs/icu/icu-src/source/common/umutex.cpp:146): std::call_once(*pInitFlag, umtx_init); which results in a fatal error from stdc++: terminate called after throwing an instance of 'std::system_error' what(): Unknown error -1 Web searching indicated that that opaque error message is a result of not linking with -lpthread. So now we do so. So far our other ICU programs (bibtexu, xetex) don't seem to require it. 2019-03-16 TANAKA Takuji * upmendex.ja.txt: Rename from upmendex.doc, minor update. 2019-03-06 Hironobu Yamashita Fix a bug in multibytelen(). 2019-02-17 TANAKA Takuji * version 0.52 Beta version. * main.c, COPYRIGHT: Update copyright year. * fread.c: Fix typo. It seems a bug. * {convert,fread,fwrite,kp,sort,styfile}.c, {hanzi,mendex}.h: Avoid compiler warnings. Thanks to @ngothan for a report by covscan. https://github.com/TeX-Live/texlive-source/issues/11 2017-04-05 TANAKA Takuji * main.c: Avoid access violation. Thanks to I. Wada san. * main.c, COPYRIGHT: Update copyright year. 2016-12-24 TANAKA Takuji * sort.c: Add Cyrillic Extended-C 2016-11-12 TANAKA Takuji * main.c: Avoid access violation. 2016-06-19 TANAKA Takuji * version 0.51 Beta version. * fwrite.c: Fix bug of option "hanzi_head" in style file. Thanks to Dr. Werner Fink. 2016-02-12 Karl Berry * configure.ac (KPSE_CXX_HACK): add to avoid usual libstdc++.so dynamic linking failure. 2016-02-09 Akira Kakuto * convert.c, fread.c, main.c: Remove unused local variables. * qsort.c: Avoid compiler warning in the case of Windows 64bit. 2016-01-31 TANAKA Takuji * version 0.50 Beta version. Support option "icu_rules" in style file. Support option "icu_attributes" in style file. Support option "priority" in style file as similar to mendex. Refine and fix several bugs in sorting procedure. Fix bug of -g option. Update samples. 2015-10-10 TANAKA Takuji * version 0.06 Alpha version. Support dotted/dotless I/i in Turkish. Support digraphs in Albanian. Modify on surrogate pair. Update samples. 2015-09-27 TANAKA Takuji * version 0.05 Alpha version. Support Hanzi ideograph (Han script) indexing. It works when icu_locale is "zh@collation=unihan", "ja@collation=unihan", "ko@collation=unihan" (radical-stroke), "zh@collation=stroke" (stroke), "zh" (pinyin) or "zh@collation=zhuyin" (zhuyin). Fix bug on surrogate pair. Update samples. 2015-08-23 TANAKA Takuji * version 0.04 Alpha version. Support digraphs and trigraphs in Latin scripts. Accept long file name for a log file and a index file. Update samples. 2015-08-08 TANAKA Takuji * version 0.03 Alpha version. Support option "icu_locale" in style file. Support diacritical marks in Latin, Greek, Cyrillic. Update samples. 2015-05-17 TANAKA Takuji * version 0.02 Alpha version. Support Korean Hangul. Fix bug on Japanese indexing. Fix bug on surrogate pair. Add option "icu_locale" in style file (Preliminary). 2015-01-24 TANAKA Takuji * Initial version 0.01. Alpha version based on mendex version 2.6f [14-Aug-2009] (TeX Live 2015/dev).