Link Grammar Parser Older Changes
Summary of Older Changes
Version 4.5.10 (25 August 2009)
- Be sure to link with -lm
Version 4.5.9 (25 August 2009)
- Modify error messages to indicate that they are from link-grammar.
- Add missing Java files that were forgotten last time around.
- Add greeting to command-line client startup.
- Print disjunct cost also, when requesting disjunct printing.
- Add missing color names as mass nouns.
- Fix: Reclassify musical instruments: "He plays piano"
- Add experimental word-clustering system.
- Add CMake build file
- Fix: "It takes longer than that."
- Fix: "He has done very well."
- Fix: a dozen optionally transitive verbs (swim, kill, etc.)
- Fix: "He's out running."
- Fix: "suddenly" is a "manner adverb", not a clausal adverb.
- Fix: Use Pg links to gerunds: "He feared hitting the wall."
- Fix: assorted numerical-range bugs.
- Fix: prep modifiers with distances: "It is a few miles out"
- Fix: Spelled-out dates: "It started in nineteen twelve"
- Fix: Misc date, time expression parsing e.g "Zero hour is here."
- Fix: Misc words, "ordered list", "screened out"
- Fix: Post-fixed numbers can act as determiners.
- Fix: "We bought the last 50 ft. of cable."
- Fix: opening directives to imperatives: "Finally, move it back."
- Fix: Improved simple equation parsing support.
- Fix: Add misc fixes from BioLG that were previously overlooked.
- Fix: "favorite" can take determiner "a" ("a favorite place")
- Fix: assorted clausal complements: "The emperor ordered it done."
- Fix: ordinals: "First on our list is ..."
- Fix: verb modifier "some of the time", "most places"
- Fix: Sit, stand take modifiers: "he stood still"
Version 4.5.8 (2 July 2009) includes the following changes:
- Fix: 'than anticipated', 'than was anticipated', etc.
- Fix: 'saw the wood'
- Fix: sometimes commas are used as if they were semicolons.
- Fix: 'We have quite enough work already, thank you!'
- Fix: allow 'and' as conjunction in entity names.
- Fix: 'I stared him down', 'They shouted him down', 'booed off'
- Fix: 'sound him out', 'look him over'
- Fix: 'Somewhere in the distance'
- Stub out list of names given to both men and women, to avoid duplication.
- Fix: 'I think so, too'
- Fixes for compilation under Cygwin.
- From Boris Iordanov: fixes to JSON java code.
- From Boris Iordanov: new java remote client code.
- Fix: Biological texts commonly have adj-noun-adj-noun chains
Version 4.5.7 (4 June 2009) includes the following changes:
- Fix 'make install' for windows (abi bug 12049)
- Fix multi-threaded bug when TRACK_SPACE_USAGE is defined.
- Add './configure --enable-mudflap' just for fun...
- Fix: "Walk tall", "Think quick"
- Fix: "... part no. 1234-56A"
- Fix regression from BioLG merge: "It cost $14 million."
- Fix come/came: "The dog came running..."
- Fix year abbreviations: "He drove a souped-up '57 Chevy"
- Fix sit, stand: "The dog stood still"
- Fix act up, act out: "He is acting out." "The motor is acting up."
- Fix notoriously, poorly: "The store was poorly stocked".
- Fix: "strong" can be adverb
- Add support for recognizing basic time zones during parsing.
- Fix: verbs acting as adjectival modifiers: "a very politicized deal."
- Fix: ".. nearly so well", "...almost so well".
- Fix financial ranges: "It will cost $10 million to $20 million to build."
- Expand handling of capitalized words that appear in entity names.
- Expand the list of characters that are recognized as quotes.
- Support usage of yes, no as sentence openers.
- Better support for directives, commands.
- Fix: "Ash Wednesday", "Fat Tuesday", etc.
- Fix: post-verbal adj: "she wiped the table dry"
- Fix: wish: "she wished me a happy birthday"
Version 4.5.6 (24 May 2009) includes the following changes:
- Bugfix: fix non-thread-safe usage.
- Changes to enable MinGW/Windows to compile.
- Update of MSVC6 build files
- Fix: pizza, fries, chopsticks.
- Export word-sense database to Java apps.
- Fix: "Was the man drunk or crazy or both?"
Version 4.5.5 (10 May 2009) includes the following changes:
- Bugfix: crash for zero-length sentences.
Version 4.5.4 (9 May 2009) includes the following changes:
- Fix: "sleep in": "A bed is something you sleep in."
- Fix: "drinking": "Let's go drinking."
- Fix go+bare infinitive: "Let's go shop", "Let's go swim"
- Fix: "Let's go for a swim." "Let's go for a smoke".
- Fix: "Let's not" "Let's not go" "Let's not cry"
- Fix: ... is
: "All he ever does is complain." - Fix: "You will die young/happy/unhappy"
- Fix: "You should exercise to stay fit."
- Fix: "We danced 'til dawn."
- Fix: "tell
off": "She had told him off." - Bugfix: sometimes spell checker would run even if turned off.
Version 4.5.3 (14 April 2009) includes the following changes:
- Haste makes waste! Revert a recent 'fix'.
Version 4.5.2 (14 April 2009) includes the following changes:
- Use re-entrant version of mbtowc in all code.
- Fix run-time breakage on Mac OSX and FreeBSD.
Version 4.5.1 (13 April 2009) includes the following changes:
- Fix Assertion failed: negative constituent length!
- Fix build break for Mac OSX.
- Force use of UTF-8 locale in the command-line tool.
Version 4.5.0 (10 April 2009) includes the following changes:
- Hack around missing SQLite3 pkgconfig on MacOS
- Fix adverbs: 'The motor ran hot', 'the door swung wide open', etc.
- Fix: 'at risk of breakdown', 'under threat of fire'
- Add regular-expression-based word guessing, from BioLG project. This provides support for many scientific/biomedical terms.
- Add spell-guessing for unknown words.
- Fix UTF8 support to be correctly thread-safe.
- BioLG: fix post-numbering: 'it started on day one'
- BioLG: add number ranges: 'it takes 2 to 3 times the effort'
- BioLG: assorted adverb fixes, typical of scientific prose.
- BioLG: initiate, attach, localize etc are optionally transitive.
- BioLG: allow fork, branch, splice, export, etc to take particles.
- BioLG: extended use of Greek letters in biomedical text.
- BioLG: support parsing of Roman numerals.
- BioLG: support Greek-letter-number combinations.
- Fix: 'she was singing', etc.
- Enable WordNet word-sense identification based on syntactical usage.
Version 4.4.3 (11 March 2009) includes the following changes:
- Look at JAVA_HOME to guess location of jni.h
- Add dictionary maintenance utility script.
- Add prototype German dictionary.
- Fix sentences containing measurements, lengths, speeds. etc.
- Add 1400 new adjectives, many of scientific origin.
- Add 250 international currencies, including special utf8 symbols.
- Performance optimization in sentence tokenization.
- Add 1100 male, female given names (from Bruce Wilcox).
- Cleanup parsing of "in-" date expressions, e.g. "in the 1940's".
- Cleanup of assorted compiler warnings.
- Fix handling of special utf8 characters in dictionary.
- Fix hang when parsing certain utf8 input strings.
- Make sure that most strings in the API are declared const char *
- Add option to command-line client to print disjunct strings.
Version 4.4.2 (15 January 2009) includes the following changes:
- Fix: 'The keys are gone', 'It is a gone desire'
- Fix: 'He poured me a glass', 'He wired me the money', etc.
- Initial rough-in of corpus statistics based parse ranking.
- Initial rough-in of corpus-guided word-sense identification.
- Fix for googlecode bugs 38, 39 (Java, Ruby failure to load)
- Add new API routine to return disjunct string.
Version 4.4.1 (15 December 2008) includes the following changes:
- Balance the dictionary tree; this speeds word-lookup slightly.
- New MSVC6 build files from Evgenii Philippov.
- Fix java server classes to pass along the link-grammar version number.
Version 4.4.0 (7 December 2008) includes the following changes:
- fix: recognize curly-single-quote ’ where straight quote can be used.
- recognize and explicitly ignore emoticon types.
- Include MSVC6 build files.
- Apply patch needed for Ruby bindings.
- fix: "Where did they come from?", per Viswanath IIIT
- fix: "Where did they go to?"
- fix: "It gives me peace of mind."
- fix: many, many incorrectly identified mass nouns.
- fix: ladle.v "molten hot" "piping hot"
- fix: "It's a shame that...", "The crux of the plan is that..."
- Performance improvements (about 11%) to prunce.c from Bruce Wilcox
- fix: "He eats with me nightly."
- Add new public api function: linkgrammar_get_version()
- MSVC9 build files from Borislav Iordanov
- Java network-efficient client-server classes from Borislav Iordanov
Version 4.3.9 (8 October 2008) includes the following changes:
- Issue 13: "John is altogether amazingly quick."
- Nonstandard spelling "unequivocably"
- Dictionary fixes for 'marginally', etc. "That one is marginally better"
- Issue 7: Dictionary fixes for 'done': "I am done working"
- dictionary entries for walk-up drivethru car-wash
- dictionary: "I am through being mad", "It was a through flight", etc.
- Issue 11: "You are doing well"
- Issue 3: "I asked Jim a question", "I told Jim a story"
- Passive subjects with objects: "I was told that crap, too" "...was asked..."
- Fixes for Apple Mac OSX (crash on non-executable stack)
- Early version of Filip Maric's boolean SAT solver
- fix: "He talked quietly of revolt."
- fix: "It consists mostly of sand.", "He talks, mostly of revolution."
- fix: "He talked mostly to Ann.", "He talks a lot."
- fix: than_usual: "He is taking longer than usual."
- fix: a batch of new verbs from Roman Khlupin
- fix: Fix crash on Apple Macintosh by correctly identifying the platform.
- fix: "San Gabriel" "Block Island" "Great Southern Bank" "de la Rente"
- fix: "I biked Johnson Creek."
Version 4.3.8 (14 August 2008) includes the following changes:
- Build fixes for Windows (missing bzero and thread-safe rand)
- Fix several mem leaks, one in the Java jni code.
- Fix crash of java6 jvm, due to blown stack limits.
Version 4.3.7 (8 August 2008) includes the following changes:
- Expanded list of given names to include those from 2005 US Census
- Export a new java jni function, to get the inflected word.
- Add a TO+ connector so that "I tend to agree" parses correctly.
- Makefile fixes for OS/X Leopard
Version 4.3.6 (21 July 2008) includes the following changes:
- Fixes for Windows MS Visual-C builds.
- Fix parsing of "He walked the dog.", "He sailed the boat."
- Add support for right-apostrophe (’) which is a non-ASCII UTF8 char.
- Add support for other non-ASCII UTF8 punctuation.
- Fix crash on printing constituent tree of certain long sentences.
- Avoid recursive error reporting for UTF8 dictionary errors.
- Clarify error logging and error printing.
- Add java getVersion() to return link-grammar version string.
- Add more numbers to dict (e.g. twenty-seven, bazillion, half-dozen, etc.)
- Foodstuffs: bagels, lox, tacos, guacamole, roe, neufchatel, mayo, etc.
- Weights and measures: megabytes, °C, km² etc.
- Performance improvements in printing of link-tree.
- Convert assert into warning when no canonical linkages can be found.
- Convert assert into warning when constituent andlist overflows.
- Provide additional checks for constituent overflows.
- Convert most error printfs into a formal error reporting system.
- Remove all globals, library is now thread-safe.
- Fix crash when sentence has square bracket, and doing constituents.
Version 4.3.5 (29 April 2008) includes the following changes:
- Added ant build file to create the link-grammar jar file.
- Fix regression in command-line client of multiple-parse display.
- Use MB_LEN_MAX, not MB_CUR_MAX for UTF8 support.
- Fix a WIN32 compiler regression (no in-line support in Windows).
- Fix error in handling of UTF8 dictionaries.
- Fix strncat() misuse in error.c
- Fix capitalization errors in country names.
- Fix parsing of "he angled left, he dodged left, he turned left".
- Don't build the JNI library if Java isn't found. Fixes build on Windows.
- Fix install bug for NetBSD systems.
- Pre-detected entities cannot participate in G links.
- There is no UTF8 support in windows, so stub it out.
- Fix crash in constituent output, bug #22 in googlecode bugtacker.
- Some small steps taken to eventually make library thread-safe.
- There are three constituent string styles, enable all three.
- Make the command-line flag errors less cryptic.
- Add readline (BSD editline) support.
- Rename "grammar-parse" to the more logical "link-parser".
- Small man page updates.
- Export and cost, link cost via public API.
Version 4.3.4 (16 March 2008) includes the following changes:
- Fix regression of handling of capitalization at the start of sentences.
- Fix dictionary search path so that it respects command-line input.
- Fix rare but nasty crash when parsing long sentences in panic mode.
- Add a method to set the dictionary path.
- Fix all remaining compiler warnings.
- Make parser capable of handling UTF8 strings and dictionaries.
- Ongoing minor expansion of the Lithuanian (lt) dictionary.
Version 4.3.3 (27 February 2008) includes the following changes:
- Missing java is a warning, not an error.
- man page for grammar-parse.
- Removed cruft from the dictionary open routines.
- configure tries to guess some non-standard jni.h locations.
- Split up java library exports, should help cygwin builds.
- Fix java library pre-linking bug.
- Minor English dictionary additions.
- Prototype Lithuanian (lt) dictionary.
Version 4.3.2 (2 February 2008) includes the following changes:
- Fix dictionary errors involving given names; e.g. any sentence with the name "John" in it.
- Minor Windows build fixes.
Version 4.3.1 (31 Jaunary 2008) includes the following changes:
- Merger of extensive dictionary additions from Peter Szolovits. This adds 15K new words, bringing the dictionary to 70K words total.
Version 4.3.0 (30 January 2008) includes the following changes:
- New link types (Ct, Cta, Rn, Rw) for comparatives, so as to link relative clauses: "John is bigger than Dave is", "John wants more cookies than Dave wants". The Rw link is used to link question words to the relative clauses that follow them.
- Dictionary Fixes for "Espresso is a coffee drink", "Teach me fetch", "I am pooped" as synonym for "I am tired", "Mother likes her", "Mommy loves me" and related. Also, directives involving "go": "Go play ball", "Go take a walk", "You and Rover go play with the ball."
- Dictionary support for external entity markup. This includes the recognition of personID0..personID60, dateID0..dateID60, organizationID0..organizationID60 and locationID0..locationID60 as appropriate words.
- Fixes of numerous compile-time warnings.
- Simple Java (JNI) bindings.
Version 4.2.5 (11 November 2007) includes the following changes:
- Fix for a security problem, involving a buffer overflow: CVE-2007-5395.

![[Logo]](/gfx/swish-a.jpg)