-*- mode: org; -*- Changes to Chatter * 1.0-alpha-111 (February 1, 2012) ** XML Schema updated to 1.9.7 Added "real" attribute to replacement element, to indicate whether the annotated word was real. So CHAT now has the [:: replacement words] annotation as distinct from [: replacement words] * 1.0-alpha-110 (January 30, 2012) Fix bug in XML generation when both %mor with translation and %gra exist. * 1.0-alpha-109 (January 5, 2012) Fix regression bug in handling form markers that broke @n. * 1.0-alpha-108 (January 3, 2012) ** XML Schema updated to 1.9.6 Change to schema to allow "-s" and other suffixes after a form marker. Specifically for a@l-s ** %com and other dependent tiers now allow all IPA characters * 1.0-alpha-107 (December 31, 2011) ** XML Schema updated to 1.9.5 CHAT breathy voice character changed Changed from 03BE to 264B. 03BE is now treated as the Greek character it really is. * 1.0-alpha-106 (December 20, 2011) ** XML Schema updated to 1.9.3 mor translation is now supported for preclitic and postclitic also. * 1.0-alpha-105 (October 5, 2011) ** XML Schema updated to 1.9.3 *** separated-prefix Support # for word to be treated as prefix for Hebrew: new word attribute "separated-prefix". *** legacy Remove the "legacy" option, finally. * 1.0-alpha-104 (September 30, 2011) ** CHAT form marker misparsed Fix form marker lexing so that invalid form markers are detected rather than misparsed. * 1.0-alpha-103 (September 29, 2011) ** XML Schema updated to 1.9.2 *** New stress characters 02C8 for primary stress 02CC for secondary stress *** Support for new dependent tier %sin There are new grouping brackets 3014 and 3015 for the main and %sin tiers. * 1.0-alpha-102 (September 20, 2011) ** Fix mor 1-1 bug "&frag [: replaced]" was not being handled properly with %mor line. ** Internal Update to ANTLR 3.4 * 1.0-alpha-101 (July 12, 2011) ** XML Schema updated to 1.9.1 *** CA Added "Breathy voice" CA character 03BE. ** Other CHAT changes *** User-defined dependent tiers Restricted total number of characters to 7 (including the initial "x"). ** Internal updates Updated to ANTLR 3.4b4, JAXB 2.2.4. * 1.0-alpha-100 (June 23, 2011) ** XML Schema updated to 1.9.0 Added word attribute user-special-form. ** CHAT Now allow word@z:code * 1.0-alpha-99 (June 16, 2011) ** Phon compatibility Updated pho allowed characters from features.xml from Phon. ** CHAT Now allow + in speaker ID. * 1.0-alpha-98 (May 23, 2011) ** XML Schema updated to 1.8.3 Added participant role CallTaker. Added "anonymized" to Transcription. * 1.0-alpha-97 (May 18, 2011) ** XML Schema updated to 1.8.2 Avoid network access (since sometimes unreliable) of xml.xsd by specifying a catalog file src/main/resources/catalog.cat mapping "http://www.w3.org/2001/xml.xsd" to "xml.xsd" * 1.0-alpha-96 (May 2, 2011) ** CHAT *** Languages Added support for more Chinese languages: nan, min, yue, and zho* ** xml2chat Check that the date att, if not default, has a matching @Date header. ** Scripts as filters Allow Chatter to be run as a filter from standard input to standard output when run from the command line. For example: $ java org.talkbank.chatter.App -inputFormat cha -outputFormat xml < input.cha > output.xml * 1.0-alpha-95 (April 8, 2011) ** CHAT Do not allow initial spaces on header or tier lines. ** Intel Leopard support Rebuilt the CLAN-communicating native code against old 10.5 SDK instead of 10.6 SDK. * 1.0-alpha-94 (February 21, 2011) ** CHAT @Number header Changed @Number header info to require arabic numerals for 1 through 5. * 1.0-alpha-93 (February 13, 2011) ** pho error message fixed pho error messages involving words with more than one letter are now giving correct location information. * 1.0-alpha-92 (January 14, 2011) ** XML Schema updated to 1.8.1 (changes to CA) *** CA symbol 224B renamed from "TCU continuation" to "technical break TCU continuation" *** CA symbol 2248 renamed from "no break continuation" to "no break TCU continuation" *** CA symbol 2261 renamed from "latching" to "uptake" * 1.0-alpha-91 (December 22, 2010) ** XML Schema updated to 1.8.0 (major change) and now have element content inside elements instead of being in , and contains and and Changed to jaxb:version="2.1" and added explicit "simple" binding mode directive. Made sure that has default namespace as XHTML. ** Fixed language marker regression bug @s with no arguments was not being parsed correctly, thanks to bug introduced in alpha-90. * 1.0-alpha-90 (December 13, 2010) ** Fixed packaging Packaging failed to pick up new JAXB-generated classes properly, so xml2chat was failing. ** Upgraded to JAXB 2.2.3 ** Error handling for @s in CHAT More graceful. * 1.0-alpha-89 (December 9, 2010) ** XML Schema updated to 1.7.1 Allow to have and in any order. * 1.0-alpha-88 (December 9, 2010) A major change to the XML Schema for Phon. ** XML Schema updated to 1.7.0 (major change) *** Phon has been split to and , and value is now element rather than attribute. Other Phon-relevant changes. *** role Participant role is now required, and restricted. ** Bullet checking If @Media is "unlinked", then bullets are not allowed. ** Java version information "About" box now includes Java version information. ** xml2chat Phon and are now allowed, but signal a warning that they are not yet supported for conversion to CHAT. ** Implementation upgrades *** Upgraded ANTLR version to 3.3 * 1.0-alpha-87 (October 29, 2010) ** CHAT Added @T for thumbnail header. ** XML Schema updated to 1.6.2 Added thumbnail header. * 1.0-alpha-86 (October 13, 2010) ** CHAT Allow words such as "d'Eoin", where the rule is that a capital letter is allowed after a letter-apostrophe combination. ** Internal development Now using Maven 3.0 for builds. * 1.0-alpha-85 (October 5, 2010) Fixed a regression bug resulting in empty CHAT output. * 1.0-alpha-84 (September 15, 2010) This is a substantial release, with many fixes and changes. XML will need to be regenerated. ** GUI application changes *** Output validation Changed default to perform output XML validation instead of omitting it. There are plans for other output validation as well. *** Input validation When unmarshalling input XML, validation is now turned on. ** CHAT changes *** %xpho, %xmod Now allow the bracket symbols 2039 and 203A, as hacks to support temporary change of %pho and %mod to %xpho and %xmod to turn off all parsing. ** XML Schema updated to 1.6.1 *** xs:float replaced with xs:decimal For start, end times and pause length. *** Fixed key and keyref constraints on CHAT Full namespaces were put in to properly create XPath addresses. Utterance id was changed from "key" to "unique". *** Nonempty strings Various attributes have been changed from xs:string to nonemptyStringType. For example, speaker role. *** Changed Media type from xs:anyURI Set up allowed media reference pattern. *** Other type changes Other miscellaneous type changes. * 1.0-alpha-83 (August 30, 2010) ** Renamed chat2xml to chatter This affects all distributed application and JAR names and scripts. Also, internal Java package and class names were changed as well. * 1.0-alpha-82 (August 17, 2010) ** Cleaned up allowable IPA characters *** On %pho tier We now use http://www.acssoft.de/world-keyboard.com/ipatables.html as the definition of allowable IPA. *** UNIBET on main line Also uses the IPA tables. * 1.0-alpha-81 (August 10, 2010) ** %pho/%xpho characters *** Updated from Phon features.xml *** More IPA Unicode allowed Found an extra source of IPA tables to extra legal Unicode characters from, including 203F. * 1.0-alpha-80 (August 4, 2010) ** XML schema updated to 1.6.0 Added "bullets" option. ** CHAT features Added "bullets" option to turn off bullet time sequence checking. ** TRN translator complete For Santa Barbara corpus. Error messages are prefixed with "0" for communication with CLAN editor. * 1.0-alpha-79 (July 19, 2010) ** XML Schema updated to 1.5.9 element has been disambiguated into (after utterance terminator, or in dependent tier) and (inside utterance before terminator). ** Serializing to CHAT *** Part of speech in word Fixed roundtrip of word$pos. *** Bullets Internal bullets are now generated with continuation line after them in CHAT. * 1.0-alpha-78 (July 14, 2010) ** Language lexing Allow zho-min. * 1.0-alpha-77 (June 27, 2010) ** XML Schema updated to 1.5.8 *** Enforced three letter language codes *** More Schema documentation added to link to chat.html ** Added more supported languages Various languages zho-* are now recognized along with zho. ** CHAT Updated allowed "@Interaction Type" and "@Number" options. * 1.0-alpha-76 (May 20, 2010) ** XML Schema updated to 1.5.7 *** Changed more documentation *** Removed form marker @m ** Removed infinite loop bug An infinite loop bug resulted from the removal of "/", "//", and "///" prosody support inside words in 1.0-alpha-75 because the "/" character was not being recognized and was being retried rather than rejected. * 1.0-alpha-75 (May 19, 2010) ** XML Schema updated to 1.5.6 Big cleanup of XML schema, removing unused material, consolidating others. First step at cleaning up documentation, inserting references to the CHAT manual, which is now online as HTML generated from the master Word document and placed along side the PDF version. The HTML version of the manual is now available as http://childes.psy.cmu.edu/manuals/chat.html. The schema is now processed by xsddoc from Filigris (http://www.filigris.com/products/docflex_xml/xsddoc/) to generate HTML documentation now available at http://talkbank.org/software/xsddoc/index.html *** Restored @Page The @Page header was removed in 1.0-alpha-37, but has been restored. *** Removed some dependent tiers %pht %ssy %spe %ton %rom *** Removed %mor error markers *** Removed 0* and 00 prefixes for words on main line *** Removed various form markers @ap @pr @sc *** Removed some old prosodic markers / // /// *** Many other miscellaneous removals ** CHAT language changes Changed two-letter language codes to three-letter codes in entire database, and no longer support the two-letter codes where the three-letter codes exist (ISO 639). *** Removed special lexical support zh-min zh-guoyu zh-yue vi *** Changed from two letters for special lexical support zho tha cym deu heb * 1.0-alpha-74 (March 30, 2010) ** GUI progress bar A bug in the progress bar has been fixed, in which "redo" resulted in the progress bar not being reinitialized back to 0. ** Duplicate dependent tiers Duplicate dependent tiers for a single utterance are now detected and flagged as erroneous. * 1.0-alpha-73 (March 25, 2010) ** %gra and %grt are now fully supported Dependent tiers that were temporarily named %xgra and %xgrt are now fully supported. * 1.0-alpha-72 (March 16, 2010) Radical changes to structure. ** CHAT language change Allow part of speech at the end of a word on the main line for optional part of speech (for use by mor). The syntax is $category or $category:subcategory or $category:subcategory:subcategory (multiple subcategories allowed) ** XML schema updated to 1.5.5 Completely removed . The main space-delimited group is now just under . Changed to remove and . Moved into an attribute of , since it is supposed to apply at that level. Preclitics and postclitics are now and children of . Mor translations have been moved out of to . may now also be present in (see CHAT language change). ** Bug fix in %mor omission In %mor, 0cat... had always been parsed incorrectly and silently, generating incorrect XML, but now is parsed correctly. ** GUI improvement The Open dialog now selects by default the directory last opened (whether in the current session or saved from the last launch of the application). * 1.0-alpha-71 (March 9, 2010) Radical changes in schema ** CHAT language change In addition to formerly supported @s and @s:otherlang, we now also allow @s:lang1&lang2 for a word that could be interpreted as either in lang1 or lang2 @s:lang1+lang2 for a word whose stem is in lang1 and suffix is in lang2 In either case, also more than two languages can be specified. ** XML schema updated to 1.5.4 no longer uses the xml:lang attribute for language, because we have introduced the possibility of multiple languages per word. A new element has been created, which has a child of , , or . * 1.0-alpha-70 (March 4, 2010) ** XML schema updated to 1.5.3 Added mor replacement (@replacement in %mor/%trn line). ** CHAT New feature: mor replacement, which occurs after any optional translations. Example: %mor: part|kick-PROG@pushing . * 1.0-alpha-69 (March 1, 2010) ** Empty %mor line 1.0-alpha-68 removed support for an empty %mor line and crashed upon encountering it. This has been fixed. * 1.0-alpha-68 (March 1, 2010) Major changes in schema affecting all transcripts with %mor. ** XML schema updated to 1.5.2 Major change: terminators now can take children. The reason for this change is to start including the %mor terminator explicitly inside inside . Introduced for "mor terminator". Various other schema cleanups, including putting back CA "unmarked ending" that was removed in 1.0-alpha-33. ** CHAT *** CA "unmarked ending" supported again *** %pho characters updated Regenerated legal %pho characters based on phon's features.xml. * 1.0-alpha-67 (February 9, 2010) ** Added %pho and %mod character Unicode 2026 is now supported. * 1.0-alpha-66 (February 5, 2010) ** Updated Phon features for %pho and %mod Set of Phon features had not been regenerated since 5-22-08. Now updated based on latest features.xml in Phon. ** Error message for length mismatches in %pho and %mod, and %mor and %trn An omission was made in the fix from 1.0-alpha-65, and has been fixed. An analogous set of fixes were made for %mor/%trn error messages. ** Underscore in %xpho, %xmod Allow underscore "_" as a character in %xpho and %xmod. * 1.0-alpha-65 (February 4, 2010) ** Mac app universal support fixes Various cleanups were made so that on the Mac, 64-bit or 32-bit mode is used as appropriate, and Java 5 or 6 (or higher). LSArchitecturePriority is now used instead of the obsolete JVMArchs. This fixes various problems in running chat2xml on old Intel Macs, as well as running chat2xml as a jar. Added -x86_64 to the Universal binary for send2clan. ** tmplib cleaned up Launching CLAN from the GUI error message window resulted in a "tmplib" directory being created and left behind in the directory containing the chat2xml application. The behavior has been changed to use instead the temporary directory given by the java.io.tmpdir property, if the user has not already set the java.library.tmpdir that mx-native-loader wants. ** Windows bug fix An important bug was fixed that caused chat2xml to fail to delete empty XML files upoon error. This bug also had the side effect of causing the "Redo" button to be useless, forcing users to manually remove the 0-sized XML files. ** Error message for %pho and %mod Now that %mod is supported in the same way as %pho, an error on it is noted as an error in %mod rather than in %pho. ** Memory of last directory processed The last directory processed is now remembered across sessions, so that a fresh launch of chat2xml will prefill the text field containing the path of the last directory in the previous session. Note: for some reason, the Mac OS app version of chat2xml is not yet saving the preference. This is being investigated. The Mac OS "Open" dialog box, however, has always saved the position of the last directory opened. * 1.0-alpha-64 (February 2, 2010) Important: CHAT syntax now has changed, so all transcripts that used %pho, %mod, or %npho must be rechecked and modified as appropriate. ** XML schema to version 1.5.1 Removed legacy %pho and %mod support. For the first time, transcripts with both %pho (formerly %npho) and %mod are correctly represented in the XML. ** CHAT Removed legacy %pho and %mod dependent tiers. %pho and %mod are now fully parsed, so legacy %pho and %mod must be renamed to %xpho and %xmod, and %npho renamed to %pho. ** Java version Removed the unnecessary restriction (introduced in 1.0-alpha-61) of running on Java 6, so now Java 5 can again be used. ** Date formatting Date formatting when going from CHAT to XML or vice versa was failing unexpectedly in France, etc., because of locale settings. chat2xml now explicitly uses English regardless of system locale setting. * 1.0-alpha-63 (January 29, 2010) xml2chat has been completed! ** GUI User interface now allows choice of input format (CHAT or XML) and output format (CHAT or XML). There are therefore four possible input-to-output conversions. Note that CHAT to CHAT or XML to XML are roundtrips going through the ANTLR tree as an intermediate internal format. ** Performance improvements A memory leak in the use of StringTemplate for generating output has been removed. * 1.0-alpha-62 (January 25, 2010) Radical changes to the XML schema. ** Schema updated to 1.5.0 *** Cleaned up form markers Removed element altogether. Shifted type attribute to "ftype" attribute in . Removed hack, turned it into character content in . *** Text before form marker is now treated as a word Previously, there were restrictions on the text before a form marker. Now, the whole unit is considered a first-class . ** CHAT changes "a@l-s" is no longer supported. The plural of "a@l" is simply "a-s". ** Progress bar or text Fixed a bug in which int overflow occurred if the amount of data being processed exceeded int range. Now we use long instead. * 1.0-alpha-61 (December 15, 2009) ** Schema updated to 1.4.9 Ongoing cleanup. Fixed eventType definition; created action and happening elements. Note: because of XML changes, old CHAT files will have to be reconverted to XML. ** Allowed CUJK symbols, punctuation in %ort ** Upgrades to libraries Upgraded to JAXB 2.2, etc. ** Java 1.6 Moved from Java 1.5 to 1.6. * 1.0-alpha-60 (November 19, 2009) ** Schema updated to 1.4.8 *** Allow tag/vocative in %mor ** Allow tag/vocative in unparsed dependent tiers * 1.0-alpha-59 (November 18, 2009) ** Schema updated to 1.4.7 Changed tagMarkerType to allow two kinds, tag and vocative. Corrected legacy hack of treating ",," as separator. %mor 1-1 is changed accordingly. ** CHAT extended with tag and vocative characters 201E is tag, 2021 is vocative ** Internals Reverted back to using CommonTreeNodeStream, and used dupTree to resolve old parser tree problems. Cleaned up code to take advantage of upgrade from ANTLR 3.1.1 to 3.2. Noticed that {terminatorIsOptional} was incorrectly not used as a gated semantic predicate. * 1.0-alpha-58 (November 5, 2009) ** Upgraded to ANTLR 3.2 From ANTLR 3.1.1 (ancient) to 3.2 antlr3-maven-plugin to 3.2 StringTemplate to 3.2.1 ** Capital letters *** Welsh Allowed capital letter after ' * 1.0-alpha-57 (October 31, 2009) ** Capital letters checking for Hebrew Now allow ~ and ' before a capital letter in the middle of a word. * 1.0-alpha-56 (October 30, 2009) ** Enforce CHECK's restrictions on capital letters Capital letters in words are now allowed only in the following cases. *** initial letter in a word *** after _ (treated as word boundary) *** after - (treated as word boundary) *** after + (treated as word boundary) if German (de) *** if @Options includes "caps" *** after a prefix (from last word boundary) listed in CLAN's depfile.cut "Mac", "Mc", "De", "La", "Las", "Los", "San", "St", "D'", "O'" *** word matches one of the patterns listed in @Exceptions Patterns can use any combination of literals and globs "*" which stand for any (optionally empty) sequence of literals, e.g., "anException" or "*N*". *** Fixed bug in handling words Underscore was not being treated as part of the stripped-down version of a word for END_WORD internal token. ** Miscellaneous *** Allow Welsh (cy) to have words with numbers A hack. *** Allow Hebrew (he) to have a capital letter after a glottal (Unicode 0294) * 1.0-alpha-55 (October 29, 2009) ** XML schema updated to 1.4.6 *** New @Media option: unlinked "unlinked" will mean that a transcript has media associated with it but has not yet been linked to it. *** Option "caps" supported "caps" was inadvertently not added to the schema in the last chat2xml release. ** New @Participants role: Partner * 1.0-alpha-54 (October 28, 2009) ** Internal ANTLR AST change Compound words now have END_WORD with the separator included in the text. ** Bullet time start checking fixed Bullets on dependent tiers should not be involved in checking time start sequence. ** Compound words with form markers Form markers @c and @n and @sl now are allowed for compounds. ** Letters @l can only be used with a single letter. ** New @Options: caps "caps" is now used to indicate that capital letters may be allowed in a word. Without "caps" or "sign", capital letters are no longer allowed. Note: capital letter checking to be compatible with CHECK is currently completely turned off, awaiting further testing and modification. * 1.0-alpha-53 (October 25, 2009) ** Schema updated to 1.4.5 Removed support for annotations to pauses. Removed %: dependent tier. Removed +"" (quotation assigned to other speaker). Removed form markers @cue, @inf, @ins. ** Changed misleading %mor/%trn/%npho 1-1 error messages The error messages were changed to simply inform of a mismatch in lengths of tiers. ** Bullet checking added Bullets are now checked for self-validity (start time earlier than end time). Bullets are now checked for consistency in the transcript (start time of a particular bullet must be later than the start time of the previous bullet) ** Comma-space Require space after comma in @Languages, @Participants, @Options, @Media * 1.0-alpha-52 (October 17, 2009) ** Participant roles updated Added some more roles, and in sync now with CLAN's depfiles.cut * 1.0-alpha-51 (October 16, 2009) ** Transcript must end right after the first @End Previously, chat2xml erroneously simply ignored all text after the first @End ** Enforced CLAN's depfile.cut restriction on participant role ** Relaxed restriction on participant name ** Every participant must now have an @ID line * 1.0-alpha-50 (September 24, 2009) Corrected lexing of special characters that were being erroneously treated as regular word letter characters: "laugh in word" and "yawn". Old XML files that are bad will need to be regenerated, or hand-edited to change the Unicode of "laugh in word" to and "yawn" delimited material must be changed to stuff * 1.0-alpha-49 (August 27, 2009) Corrected internal error upon parsing a nonempty "education" field of @ID. * 1.0-alpha-48 (August 26, 2009) ** chat2xml Had forgotten to remove parsing support for removed inlined dependent tiers add, int, sit, gpx, spe. * 1.0-alpha-47 (August 26, 2009) ** Radical changes in XML schema, to version 1.4.4 Changed schema so that is only used for utterance level annotations, not for group level; new element for group level annotation (including scoped symbols). *** Added groupAnnotationType, groupAnnotationTypeType. *** Removed some inlined dependent tiers add, int, sit, gpx, spe ** chat2xml syntax change in @Media For consistency, a comma is now required to separate the fields, e.g., @Media: filename video missing must now be @Media: filename, video, missing * 1.0-alpha-46 (August 1, 2009) ** @L1 of Removed incorrect restriction of language specified (alpha 45 required it to be listed in @Languages) * 1.0-alpha-45 (July 31, 2009) Schema and chat2xml changes. XML schema updated to version 1.4.3. ** Schema 1.4.3 Added attributes "birthplace" and "first-language". Removed obsolete phoKey key and keyref. Changed speaker ids from ID and IDREF to just string. ** New CHAT headers @Birthplace of @L1 of ** Phon "+" (7-28-09) Supported in %npho line, passed into pho "value" attribute for now. ** Speaker IDs may now start with numeral For example, "123" is now legal. * 1.0-alpha-44 (July 20, 2009) A radical change was made to the parsing of and the XML representation of phonological fragments ("&frag"), to treat the "frag" part as fully equivalent to an ordinary word, and therefore having all the associated substructure. All old XML documents using phonological fragments need to be converted accordingly. ** XML schema updated to version 1.4.2 *** Has been put back into chat2xml and the schema, for the sake of Heritage. *** Phonological fragments are now with special characters now parsed into substructure and obeying the constraints of the current language. *** Now allowed in dependent tiers. * 1.0-alpha-43 (June 16, 2009) Retraces now work in all contexts properly with pho 1-1. * 1.0-alpha-42 (June 15, 2009) ** CHAT changes for pho Groups are now fully supported on the main line. * 1.0-alpha-41 (June 15, 2009) ** CHAT changes for pho Added pho group begin (Unicode 2039) and end (Unicode 203A) for main line and the %npho line. Removed use of # from the %npho line, replacing with the new group begin/end instead. Groups are not fully implemented yet in the main line, but will be soon. * 1.0-alpha-40 (June 12, 2009) ** Major changes to %npho parsing *** %npho now uses space to separate words *** %npho now uses # to separate groups Groups are not yet supported in the CHAT main line, but will be eventually. Right now, each individual word on the main line is treated as a group containing a single word. ** Changes to allowable %npho phones Was allowing A-Z before, but now restricting to C, G, V, X, S * 1.0-alpha-39 (June 11, 2009) ** Schema updated to 1.4.1 *** removed , *** rid "line" option *** Various cleanups, refactoring ** $ in %npho We are now temporarily (?) allowing $ to appear in %npho. * 1.0-alpha-38 (June 1, 2009) Note: this version breaks compatibility with XML files that have compound words in %mor. ** Schema updated to 1.4.0 *** Created morphemicCompoundWordType *** Allowed prefixes for a compound word * 1.0-alpha-37 (May 29, 2009) This is a major release altering the XML schema, updating it to version 1.3.9. Old XML files will no longer be valid if they used certain headers or attributes, or more than one language. ** Header change of @Coder @Coder has been replaced with @Transcriber and is no longer supported. ** Removed "@Language of" header ** Added headers to schema @Interaction Type @Number @Recording Quality @Transcription @Blank ** Removed unused headers from schema @Media (we have been putting into attribute instead) @Education @Code @Page @Script @Keywords ** Removed unused CHAT attributes Coder Duration Situation Warning ** Cleaned up various language attributes Renamed languageType to languagesType and made it no longer a string, but a list of xs:language, hence fully checked, and space-separated rather than comma-separated * 1.0-alpha-36 (May 11, 2009) ** XML output Fixed a localization problem in generation of floats for media bullets. Floats are now generated always with "." and never with ",". * 1.0-alpha-35 (May 5, 2009) ** Schema changes to 1.3.8 *** Cleanup Made media unit required. Removed unused enumerations. Unified various internal element definitions of the same names, moving them out and using refs. *** Options attribute Added "sign". *** CA Changed double-wavy latching (subword element) 2248 to 2261 Change the function of the current double-wavy 2248 to a terminator ("no break continuation") like the already existing triple-wavy 224B. Added corresponding double-wavy continuer (with +). ** Roundtrip Roundtrip is still being completed. Now using JAXB 2.1.11. * 1.0-alpha-34 (April 16, 2009) ** Fixed CA mode handling of missing terminator In CA mode, a missing terminator should be OK, even without a bullet following it. ** Fixed generation of multiple options into XML Now space-delimited as required by schema. * 1.0-alpha-33 (April 16, 2009) ** Fixed regression bug introduced in alpha-25 Quotations ["] now work properly. ** CHAT, XML schema change XML schema updated to version 1.3.7. *** Options Added "CA" back to options. This enables terminators to be optional in utterances. Added "line" to options. This is purely for CLAN display of bullets, just as "multi" was, and has no semantic significance. *** CA separators, terminators Removed CA separators added in alpha 30. Changed 5 arrow CA terminators into CA prosodies. Removed continuing terminators introducted in alpha 31. Removed unmarked ending. *** TCU pause/bullet Removed TCU pause and bullet; these concepts are now folded into the utterance. Old transcripts must be changed to put any terminator after the TCU pause, if there is one. * 1.0-alpha-32 (April 8, 2009) ** CHAT, XML schema change XML schema updated to version 1.3.6. Added otherSpokenEvent. New CHAT syntax is &*WHO:word * 1.0-alpha-31 (April 5, 2009) ** CHAT, XML schema change XML schema updated to version 1.3.5. Added continuing terminators parallel to the six special CA terminators. * 1.0-alpha-30 (April 3, 2009) ** CHAT, XML schema change XML schema updated to version 1.3.4. Added separators parallel to the five special CA terminators. * 1.0-alpha-29 (March 31, 2009) ** Subword handling Allow a single hyphen to be a subword. * 1.0-alpha-28 (March 31, 2009) ** CHAT, XML schema change Added TCU continuation linker. XML schema updated to version 1.3.3. ** Subword handling Now allow hyphens to begin subwords. * 1.0-alpha-27 (March 26, 2009) ** Library upgrades Upgraded to JFlex 1.4.3 ** XML generation More "@Options: heritage" fixes. * 1.0-alpha-26 (March 23, 2009) This is an important release fixing bugs in XML generation. ** XML generation *** Multiple @Date headers were getting overwritten by the first Now all the information is preserved properly. *** Fixed generation of retraces (regression bug) *** Heritage control characters were corrupted Fixed, and also changed and improved Heritage generation in any case. * 1.0-alpha-25 (March 16, 2009) ** Fixed %mor handling A main line word with both an explanation and a retrace was incorrectly treated as non-ignored. * 1.0-alpha-24 (March 11, 2009) This is a major release, because of an incompatible CHAT format change. chat2xml will no longer accept CHAT files in the old media bullet format. File names no longer go into each bullet, because now, only one media file is allowed per CHAT file. Now there is a single @Media header that lists the media file for the transcript. Any transcripts that reference more than one media file will have to be split up, or the media files joined into one. ** Major CHAT format change: media bullets *** @Media header after @ID (and optional @Birth of and @Lang of) headers The allowed syntax: @Media: name audio @Media: name video @Media: name audio missing @Media: name video missing *** Audio/video bullets no longer have a file name or type attached to them ** Huge number of XML schema changes, to version 1.3.2 *** Dependent tiers **** %cod is now "coding" **** Dependent tiers %pse, %fan, %wor have been removed **** Dependent tiers %x... now have "flavor" attribute with "x" removed **** %mor, %trn can now include an optional because of "tag question" separator. *** Media **** element has been stripped of "href" and "type" attributes **** New element introduced **** "Mediatypes" attribute introduced, for "video", "audio", "missing" *** Miscellaneous cleanups **** "Options" attribute is now space-delimited and checked **** Other schema cleanups * 1.0-alpha-23 (February 12, 2009) ** Fixed critical regression bug in XML generation Scoped annotations were not being generated into the XML. ** Fixed ampersands in happenings, e.g., &=A&P Was being incorrectly parsed as &=A &P ** Added check to enforce @ID for each participant * 1.0-alpha-22 (January 19, 2009) ** fixed bugs resulting in internal errors during XML validation ** updated schema to 1.3.1 for cleanup ** check whether a participant exists when id encountered in utterance ** fixed regression bug existing good XML files were being deleted in some cases if no regeneration was necessary * 1.0-alpha-21 (January 16, 2009) ** Upgraded to ANTLR 3.1.1 ** schema is now at version 1.3.0 ** introduced allowing -s -es etc. ** XML validation can now be turned on with -validate on the command line Still have not provided a "preferences" option for the GUI, however ** Added -outputFormat for the command line (can either be "cha" or "xml") "cha" is still not ready to be used * 1.0-alpha-20 (December 8, 2008) ** New duration annotation for groups Using the same format as for pauses, e.g., *CHI: [# 3.] . *CHI: [# 2:3.] . *CHI: [# 2:3.4] . *CHI: [# 3.4] . * 1.0-alpha-19 (December 5, 2008) ** Precodes removed Precodes have been removed from the schema; any use of precodes should be changed to freecodes instead ** CA TCU continuation has been fixed to become a CA terminator rather than a CA element * 1.0-alpha-18 (November 24, 2008) ** Platform-independent jar chat2xml.jar is now provided, which works for Windows, Unix, etc. Double-clicking on error messages works for Windows to pop up CLAN. ** %co1 through %co9 %co1 through %co9 have been removed ** XML *** The CHAT attribute Mode has been removed *** The Media attribute has been added This corresponds to CHAT @Media header. * Earlier versions (omitted)