[Logo]

Link Grammar Parser Older Changes

Summary of Older Changes

Version 4.5.10 (25 August 2009)

  • Be sure to link with -lm

Version 4.5.9 (25 August 2009)

  • Modify error messages to indicate that they are from link-grammar.
  • Add missing Java files that were forgotten last time around.
  • Add greeting to command-line client startup.
  • Print disjunct cost also, when requesting disjunct printing.
  • Add missing color names as mass nouns.
  • Fix: Reclassify musical instruments: "He plays piano"
  • Add experimental word-clustering system.
  • Add CMake build file
  • Fix: "It takes longer than that."
  • Fix: "He has done very well."
  • Fix: a dozen optionally transitive verbs (swim, kill, etc.)
  • Fix: "He's out running."
  • Fix: "suddenly" is a "manner adverb", not a clausal adverb.
  • Fix: Use Pg links to gerunds: "He feared hitting the wall."
  • Fix: assorted numerical-range bugs.
  • Fix: prep modifiers with distances: "It is a few miles out"
  • Fix: Spelled-out dates: "It started in nineteen twelve"
  • Fix: Misc date, time expression parsing e.g "Zero hour is here."
  • Fix: Misc words, "ordered list", "screened out"
  • Fix: Post-fixed numbers can act as determiners.
  • Fix: "We bought the last 50 ft. of cable."
  • Fix: opening directives to imperatives: "Finally, move it back."
  • Fix: Improved simple equation parsing support.
  • Fix: Add misc fixes from BioLG that were previously overlooked.
  • Fix: "favorite" can take determiner "a" ("a favorite place")
  • Fix: assorted clausal complements: "The emperor ordered it done."
  • Fix: ordinals: "First on our list is ..."
  • Fix: verb modifier "some of the time", "most places"
  • Fix: Sit, stand take modifiers: "he stood still"

Version 4.5.8 (2 July 2009) includes the following changes:

  • Fix: 'than anticipated', 'than was anticipated', etc.
  • Fix: 'saw the wood'
  • Fix: sometimes commas are used as if they were semicolons.
  • Fix: 'We have quite enough work already, thank you!'
  • Fix: allow 'and' as conjunction in entity names.
  • Fix: 'I stared him down', 'They shouted him down', 'booed off'
  • Fix: 'sound him out', 'look him over'
  • Fix: 'Somewhere in the distance'
  • Stub out list of names given to both men and women, to avoid duplication.
  • Fix: 'I think so, too'
  • Fixes for compilation under Cygwin.
  • From Boris Iordanov: fixes to JSON java code.
  • From Boris Iordanov: new java remote client code.
  • Fix: Biological texts commonly have adj-noun-adj-noun chains

Version 4.5.7 (4 June 2009) includes the following changes:

  • Fix 'make install' for windows (abi bug 12049)
  • Fix multi-threaded bug when TRACK_SPACE_USAGE is defined.
  • Add './configure --enable-mudflap' just for fun...
  • Fix: "Walk tall", "Think quick"
  • Fix: "... part no. 1234-56A"
  • Fix regression from BioLG merge: "It cost $14 million."
  • Fix come/came: "The dog came running..."
  • Fix year abbreviations: "He drove a souped-up '57 Chevy"
  • Fix sit, stand: "The dog stood still"
  • Fix act up, act out: "He is acting out." "The motor is acting up."
  • Fix notoriously, poorly: "The store was poorly stocked".
  • Fix: "strong" can be adverb
  • Add support for recognizing basic time zones during parsing.
  • Fix: verbs acting as adjectival modifiers: "a very politicized deal."
  • Fix: ".. nearly so well", "...almost so well".
  • Fix financial ranges: "It will cost $10 million to $20 million to build."
  • Expand handling of capitalized words that appear in entity names.
  • Expand the list of characters that are recognized as quotes.
  • Support usage of yes, no as sentence openers.
  • Better support for directives, commands.
  • Fix: "Ash Wednesday", "Fat Tuesday", etc.
  • Fix: post-verbal adj: "she wiped the table dry"
  • Fix: wish: "she wished me a happy birthday"

Version 4.5.6 (24 May 2009) includes the following changes:

  • Bugfix: fix non-thread-safe usage.
  • Changes to enable MinGW/Windows to compile.
  • Update of MSVC6 build files
  • Fix: pizza, fries, chopsticks.
  • Export word-sense database to Java apps.
  • Fix: "Was the man drunk or crazy or both?"

Version 4.5.5 (10 May 2009) includes the following changes:

  • Bugfix: crash for zero-length sentences.

Version 4.5.4 (9 May 2009) includes the following changes:

  • Fix: "sleep in": "A bed is something you sleep in."
  • Fix: "drinking": "Let's go drinking."
  • Fix go+bare infinitive: "Let's go shop", "Let's go swim"
  • Fix: "Let's go for a swim." "Let's go for a smoke".
  • Fix: "Let's not" "Let's not go" "Let's not cry"
  • Fix: ... is : "All he ever does is complain."
  • Fix: "You will die young/happy/unhappy"
  • Fix: "You should exercise to stay fit."
  • Fix: "We danced 'til dawn."
  • Fix: "tell off": "She had told him off."
  • Bugfix: sometimes spell checker would run even if turned off.

Version 4.5.3 (14 April 2009) includes the following changes:

  • Haste makes waste! Revert a recent 'fix'.

Version 4.5.2 (14 April 2009) includes the following changes:

  • Use re-entrant version of mbtowc in all code.
  • Fix run-time breakage on Mac OSX and FreeBSD.

Version 4.5.1 (13 April 2009) includes the following changes:

  • Fix Assertion failed: negative constituent length!
  • Fix build break for Mac OSX.
  • Force use of UTF-8 locale in the command-line tool.

Version 4.5.0 (10 April 2009) includes the following changes:

  • Hack around missing SQLite3 pkgconfig on MacOS
  • Fix adverbs: 'The motor ran hot', 'the door swung wide open', etc.
  • Fix: 'at risk of breakdown', 'under threat of fire'
  • Add regular-expression-based word guessing, from BioLG project. This provides support for many scientific/biomedical terms.
  • Add spell-guessing for unknown words.
  • Fix UTF8 support to be correctly thread-safe.
  • BioLG: fix post-numbering: 'it started on day one'
  • BioLG: add number ranges: 'it takes 2 to 3 times the effort'
  • BioLG: assorted adverb fixes, typical of scientific prose.
  • BioLG: initiate, attach, localize etc are optionally transitive.
  • BioLG: allow fork, branch, splice, export, etc to take particles.
  • BioLG: extended use of Greek letters in biomedical text.
  • BioLG: support parsing of Roman numerals.
  • BioLG: support Greek-letter-number combinations.
  • Fix: 'she was singing', etc.
  • Enable WordNet word-sense identification based on syntactical usage.

Version 4.4.3 (11 March 2009) includes the following changes:

  • Look at JAVA_HOME to guess location of jni.h
  • Add dictionary maintenance utility script.
  • Add prototype German dictionary.
  • Fix sentences containing measurements, lengths, speeds. etc.
  • Add 1400 new adjectives, many of scientific origin.
  • Add 250 international currencies, including special utf8 symbols.
  • Performance optimization in sentence tokenization.
  • Add 1100 male, female given names (from Bruce Wilcox).
  • Cleanup parsing of "in-" date expressions, e.g. "in the 1940's".
  • Cleanup of assorted compiler warnings.
  • Fix handling of special utf8 characters in dictionary.
  • Fix hang when parsing certain utf8 input strings.
  • Make sure that most strings in the API are declared const char *
  • Add option to command-line client to print disjunct strings.

Version 4.4.2 (15 January 2009) includes the following changes:

  • Fix: 'The keys are gone', 'It is a gone desire'
  • Fix: 'He poured me a glass', 'He wired me the money', etc.
  • Initial rough-in of corpus statistics based parse ranking.
  • Initial rough-in of corpus-guided word-sense identification.
  • Fix for googlecode bugs 38, 39 (Java, Ruby failure to load)
  • Add new API routine to return disjunct string.

Version 4.4.1 (15 December 2008) includes the following changes:

  • Balance the dictionary tree; this speeds word-lookup slightly.
  • New MSVC6 build files from Evgenii Philippov.
  • Fix java server classes to pass along the link-grammar version number.

Version 4.4.0 (7 December 2008) includes the following changes:

  • fix: recognize curly-single-quote ’ where straight quote can be used.
  • recognize and explicitly ignore emoticon types.
  • Include MSVC6 build files.
  • Apply patch needed for Ruby bindings.
  • fix: "Where did they come from?", per Viswanath IIIT
  • fix: "Where did they go to?"
  • fix: "It gives me peace of mind."
  • fix: many, many incorrectly identified mass nouns.
  • fix: ladle.v "molten hot" "piping hot"
  • fix: "It's a shame that...", "The crux of the plan is that..."
  • Performance improvements (about 11%) to prunce.c from Bruce Wilcox
  • fix: "He eats with me nightly."
  • Add new public api function: linkgrammar_get_version()
  • MSVC9 build files from Borislav Iordanov
  • Java network-efficient client-server classes from Borislav Iordanov

Version 4.3.9 (8 October 2008) includes the following changes:

  • Issue 13: "John is altogether amazingly quick."
  • Nonstandard spelling "unequivocably"
  • Dictionary fixes for 'marginally', etc. "That one is marginally better"
  • Issue 7: Dictionary fixes for 'done': "I am done working"
  • dictionary entries for walk-up drivethru car-wash
  • dictionary: "I am through being mad", "It was a through flight", etc.
  • Issue 11: "You are doing well"
  • Issue 3: "I asked Jim a question", "I told Jim a story"
  • Passive subjects with objects: "I was told that crap, too" "...was asked..."
  • Fixes for Apple Mac OSX (crash on non-executable stack)
  • Early version of Filip Maric's boolean SAT solver
  • fix: "He talked quietly of revolt."
  • fix: "It consists mostly of sand.", "He talks, mostly of revolution."
  • fix: "He talked mostly to Ann.", "He talks a lot."
  • fix: than_usual: "He is taking longer than usual."
  • fix: a batch of new verbs from Roman Khlupin
  • fix: Fix crash on Apple Macintosh by correctly identifying the platform.
  • fix: "San Gabriel" "Block Island" "Great Southern Bank" "de la Rente"
  • fix: "I biked Johnson Creek."

Version 4.3.8 (14 August 2008) includes the following changes:

  • Build fixes for Windows (missing bzero and thread-safe rand)
  • Fix several mem leaks, one in the Java jni code.
  • Fix crash of java6 jvm, due to blown stack limits.

Version 4.3.7 (8 August 2008) includes the following changes:

  • Expanded list of given names to include those from 2005 US Census
  • Export a new java jni function, to get the inflected word.
  • Add a TO+ connector so that "I tend to agree" parses correctly.
  • Makefile fixes for OS/X Leopard

Version 4.3.6 (21 July 2008) includes the following changes:

  • Fixes for Windows MS Visual-C builds.
  • Fix parsing of "He walked the dog.", "He sailed the boat."
  • Add support for right-apostrophe (’) which is a non-ASCII UTF8 char.
  • Add support for other non-ASCII UTF8 punctuation.
  • Fix crash on printing constituent tree of certain long sentences.
  • Avoid recursive error reporting for UTF8 dictionary errors.
  • Clarify error logging and error printing.
  • Add java getVersion() to return link-grammar version string.
  • Add more numbers to dict (e.g. twenty-seven, bazillion, half-dozen, etc.)
  • Foodstuffs: bagels, lox, tacos, guacamole, roe, neufchatel, mayo, etc.
  • Weights and measures: megabytes, °C, km² etc.
  • Performance improvements in printing of link-tree.
  • Convert assert into warning when no canonical linkages can be found.
  • Convert assert into warning when constituent andlist overflows.
  • Provide additional checks for constituent overflows.
  • Convert most error printfs into a formal error reporting system.
  • Remove all globals, library is now thread-safe.
  • Fix crash when sentence has square bracket, and doing constituents.

Version 4.3.5 (29 April 2008) includes the following changes:

  • Added ant build file to create the link-grammar jar file.
  • Fix regression in command-line client of multiple-parse display.
  • Use MB_LEN_MAX, not MB_CUR_MAX for UTF8 support.
  • Fix a WIN32 compiler regression (no in-line support in Windows).
  • Fix error in handling of UTF8 dictionaries.
  • Fix strncat() misuse in error.c
  • Fix capitalization errors in country names.
  • Fix parsing of "he angled left, he dodged left, he turned left".
  • Don't build the JNI library if Java isn't found. Fixes build on Windows.
  • Fix install bug for NetBSD systems.
  • Pre-detected entities cannot participate in G links.
  • There is no UTF8 support in windows, so stub it out.
  • Fix crash in constituent output, bug #22 in googlecode bugtacker.
  • Some small steps taken to eventually make library thread-safe.
  • There are three constituent string styles, enable all three.
  • Make the command-line flag errors less cryptic.
  • Add readline (BSD editline) support.
  • Rename "grammar-parse" to the more logical "link-parser".
  • Small man page updates.
  • Export and cost, link cost via public API.

Version 4.3.4 (16 March 2008) includes the following changes:

  • Fix regression of handling of capitalization at the start of sentences.
  • Fix dictionary search path so that it respects command-line input.
  • Fix rare but nasty crash when parsing long sentences in panic mode.
  • Add a method to set the dictionary path.
  • Fix all remaining compiler warnings.
  • Make parser capable of handling UTF8 strings and dictionaries.
  • Ongoing minor expansion of the Lithuanian (lt) dictionary.

Version 4.3.3 (27 February 2008) includes the following changes:

  • Missing java is a warning, not an error.
  • man page for grammar-parse.
  • Removed cruft from the dictionary open routines.
  • configure tries to guess some non-standard jni.h locations.
  • Split up java library exports, should help cygwin builds.
  • Fix java library pre-linking bug.
  • Minor English dictionary additions.
  • Prototype Lithuanian (lt) dictionary.

Version 4.3.2 (2 February 2008) includes the following changes:

  • Fix dictionary errors involving given names; e.g. any sentence with the name "John" in it.
  • Minor Windows build fixes.

Version 4.3.1 (31 Jaunary 2008) includes the following changes:

  • Merger of extensive dictionary additions from Peter Szolovits. This adds 15K new words, bringing the dictionary to 70K words total.

Version 4.3.0 (30 January 2008) includes the following changes:

  • New link types (Ct, Cta, Rn, Rw) for comparatives, so as to link relative clauses: "John is bigger than Dave is", "John wants more cookies than Dave wants". The Rw link is used to link question words to the relative clauses that follow them.
  • Dictionary Fixes for "Espresso is a coffee drink", "Teach me fetch", "I am pooped" as synonym for "I am tired", "Mother likes her", "Mommy loves me" and related. Also, directives involving "go": "Go play ball", "Go take a walk", "You and Rover go play with the ball."
  • Dictionary support for external entity markup. This includes the recognition of personID0..personID60, dateID0..dateID60, organizationID0..organizationID60 and locationID0..locationID60 as appropriate words.
  • Fixes of numerous compile-time warnings.
  • Simple Java (JNI) bindings.

Version 4.2.5 (11 November 2007) includes the following changes:

  • Fix for a security problem, involving a buffer overflow: CVE-2007-5395.