Commit Graph

  • 8a95c8b1f2 Appveyor clen build #4 maxirmx 2015-10-12 23:22:00 +0300
  • 3069929c25 Appveyor clean build #3 maxirmx 2015-10-12 23:18:42 +0300
  • ec8e622cec Appveyor clean build #2 maxirmx 2015-10-12 23:05:47 +0300
  • 298769cdf3 Appveyor clean build maxirmx 2015-10-12 23:01:37 +0300
  • 26b8dd47db Appveyor build maxirmx 2015-10-12 01:10:30 +0300
  • fc68d1c51c Appveyor build maxirmx 2015-10-12 01:04:21 +0300
  • c14c6f5579 Appveyour build maxirmx 2015-10-12 00:58:03 +0300
  • ded37538ce Appveyor build maxirmx 2015-10-12 00:51:33 +0300
  • 91ec215590 Appveyor build maxirmx 2015-10-12 00:50:34 +0300
  • b73c7ab45e Appveyor build maxirmx 2015-10-12 00:43:31 +0300
  • c1ca220769 Appveyor build maxirmx 2015-10-11 23:58:41 +0300
  • a0ffb856da Appveyor build maxirmx 2015-10-11 23:55:38 +0300
  • 8a91527952 Appveyor maxirmx 2015-10-11 23:50:06 +0300
  • deacdfd647 Appveyor - Removed noise maxirmx 2015-10-11 23:49:02 +0300
  • f5ebbbde80 Appveyour build maxirmx 2015-10-11 23:47:12 +0300
  • 7587f6170f Added Pythhon3 environment configuration batch maxirmx 2015-10-11 20:28:13 +0300
  • 5aeb8ac4e0 Added appveyor Python installation script maxirmx 2015-10-11 20:25:42 +0300
  • 3b79d67462 * Fix assertion in test_basic_create Matthew Honnibal 2015-10-12 00:48:18 +1100
  • aab71668cd Merge remote-tracking branch 'refs/remotes/honnibal/master' maxirmx 2015-10-11 16:47:49 +0300
  • afec8cac20 * Add more tests to probe mingw32 failure Matthew Honnibal 2015-10-11 22:40:04 +1100
  • 356181944a Merge remote-tracking branch 'refs/remotes/honnibal/master' maxirmx 2015-10-11 13:38:03 +0300
  • dba1daf597 * Add script to test loading different components Matthew Honnibal 2015-10-11 19:46:53 +1100
  • 1b2d7da819 Appveyor build maxirmx 2015-10-11 11:35:07 +0300
  • 5c9d438512 Appveyor build maxirmx 2015-10-11 11:28:53 +0300
  • e8ae2dc8a4 Appveyor windows build config maxirmx 2015-10-11 11:26:03 +0300
  • dfa19c757f Merge remote-tracking branch 'refs/remotes/honnibal/master' maxirmx 2015-10-11 11:06:22 +0300
  • 92f750cf8b * Use a gzipped frequencies file in init_model Matthew Honnibal 2015-10-11 06:59:44 +0200
  • cc92f3f0ed * Fix Matcher test Matthew Honnibal 2015-10-11 14:59:12 +1100
  • 1f8f81f0c8 * Fix missing import Matthew Honnibal 2015-10-11 14:38:21 +1100
  • 693dd06547 * Add basic, non-data dependent class creation tests, without depending on pytest. For use in debugging MS build issues, for Issue #132 Matthew Honnibal 2015-10-11 14:29:12 +1100
  • 9d949c857b More dirty Windows stuff - just for now maxirmx 2015-10-10 20:11:20 +0300
  • 8e03239ac5 Merge remote-tracking branch 'refs/remotes/honnibal/master' maxirmx 2015-10-10 17:38:06 +0300
  • 815994a212 MSVC x86-64 Pyton 2.7 dirty build maxirmx 2015-10-10 17:32:44 +0300
  • 0090f79fbd * Use lower case strings for dependency label names in symbols enum Matthew Honnibal 2015-10-10 22:59:14 +1100
  • 4c16307b10 * Fix parts_of_speech now that symbols list has been reformed Matthew Honnibal 2015-10-10 22:58:34 +1100
  • 8f0f47b9a6 * Use the symbols file in vocab instead of the symbols subfiles like attrs.pxd Matthew Honnibal 2015-10-10 22:12:06 +1100
  • 6b30d1cf7b * Remove qualified naming in symbols Matthew Honnibal 2015-10-10 22:11:38 +1100
  • eb23f2c31d * Map NO_TAG to NIL in parts_of_speech.pxd Matthew Honnibal 2015-10-10 22:11:20 +1100
  • b329c56c44 * Addmorphology symbols to morphology. May need to remove these as an enum. Matthew Honnibal 2015-10-10 22:10:58 +1100
  • 22bd0095f5 * Map empty string to NULL_ATTR in attrs Matthew Honnibal 2015-10-10 22:10:19 +1100
  • 7488821677 * Map NIL to empty string in tag map Matthew Honnibal 2015-10-10 22:09:50 +1100
  • 20e909d2bb * Fix empty values in attributes and parts of speech, so symbols align correctly with the StringStore Matthew Honnibal 2015-10-10 18:27:03 +1100
  • e18fbcb604 * Allow SPACY_DATA environment variable in website tests Matthew Honnibal 2015-10-10 17:58:57 +1100
  • 1cac36bf1c * Add symbols to the vocab before reading the strings, so that they line up correctly Matthew Honnibal 2015-10-10 17:58:29 +1100
  • 94bafc1417 * Rename ATTR_IDS to attrs.IDS. Rename ATTR_NAMES to attrs.NAMES. Rename UNIV_POS_IDS to parts_of_speech.IDS Matthew Honnibal 2015-10-10 17:55:55 +1100
  • 3cea417852 * Enumerate all symbols in one file Matthew Honnibal 2015-10-07 00:41:17 +1100
  • 4bbd1388bd * Whitespace Matthew Honnibal 2015-10-07 00:40:22 +1100
  • 064bd69ad0 * Refactor symbols, so that frequency rank can be derived from the orth id of a word. Matthew Honnibal 2015-10-07 00:39:50 +1100
  • 08e29519a6 * Add test for how spaces are attached by the parser. Matthew Honnibal 2015-10-10 16:03:13 +1100
  • dfbcff2ff1 * Revert codecs/io change to strings.pyx, as it seemed to cause an error? Will investigate. Matthew Honnibal 2015-10-10 15:54:55 +1100
  • bdcb8d695c * Add non-breaking space to specials.json Matthew Honnibal 2015-10-10 15:54:06 +1100
  • 9dd2f25c74 * Fix Issue #131: Force whitespace characters to attach syntactically to previous token, and ensure they cannot serve as stand-alone 'sentence' units. Matthew Honnibal 2015-10-10 15:53:30 +1100
  • 8b39feefbe * Add dependency post-process rule to ensure spaces are attached to neighbouring tokens, so that they can't be sentence boundaries Matthew Honnibal 2015-10-10 15:32:13 +1100
  • 1521cf25c9 * Fix merge problem in test_parse_navigate Matthew Honnibal 2015-10-10 15:04:01 +1100
  • c12d36d5f4 * Fix quote marks in lemma_rules Matthew Honnibal 2015-10-10 15:03:36 +1100
  • 2153067958 * Fix use of io in strings.pyx Matthew Honnibal 2015-10-10 15:03:12 +1100
  • ec874247b5 Merge branch 'master' of ssh://github.com/honnibal/spaCy Matthew Honnibal 2015-10-10 14:23:51 +1100
  • 30de4135c9 * Fix merge problem Matthew Honnibal 2015-10-10 14:22:32 +1100
  • dc393a5f1d Merge pull request #126 from tomtung/master Matthew Honnibal 2015-10-10 14:14:57 +1100
  • 6ea8f99a10 Merge branch 'alvations-master' Matthew Honnibal 2015-10-10 14:13:24 +1100
  • 83dccf0fd7 * Use io module insteads of deprecated codecs module Matthew Honnibal 2015-10-10 14:13:01 +1100
  • 55cd7008bb Merge branch 'master' of ssh://github.com/honnibal/spaCy Matthew Honnibal 2015-10-10 14:07:55 +1100
  • 57b3cd4661 * Add smart-quotes to lemma rules Matthew Honnibal 2015-10-10 14:06:46 +1100
  • 7e7f28e1fd * Add smart-quote possessive marker in generate_specials Matthew Honnibal 2015-10-10 14:06:09 +1100
  • 41c50e509c Merge pull request #137 from henningpeters/master Matthew Honnibal 2015-10-10 01:40:29 +1100
  • 8b8d048385 Merge pull request #135 from henningpeters/patch-1 Matthew Honnibal 2015-10-10 01:40:15 +1100
  • d31c911f83 Merge pull request #136 from henningpeters/patch-2 Matthew Honnibal 2015-10-10 01:40:00 +1100
  • 7a47c0c872 push version Henning Peters 2015-10-09 16:37:57 +0200
  • 88b2f7ea5d push version and add spacy channel Henning Peters 2015-10-09 16:30:23 +0200
  • 876fc99c44 cleanup Henning Peters 2015-10-09 16:11:56 +0200
  • a3dfe2b901 * Increment data version 0.94 Matthew Honnibal 2015-10-09 13:26:17 +0200
  • af8d0a2a09 * Increment version Matthew Honnibal 2015-10-09 12:42:41 +0200
  • 3bf50ab830 * Ensure the fabfile prebuild command installs pytest Matthew Honnibal 2015-10-09 20:57:47 +1100
  • 599f739ddb * Fix smart quote lemma test Matthew Honnibal 2015-10-09 20:51:28 +1100
  • 5682439d1e * Remove em dash test from test_lemmatizer, as em dashes are now handled in specials.json Matthew Honnibal 2015-10-09 20:24:21 +1100
  • f35632e2e5 * Remove SBD print statement in train, after SBD evaluation was removed from Scorer Matthew Honnibal 2015-10-09 11:08:58 +0200
  • 1f90502ce8 * Fix website/test_home for Python 3 Matthew Honnibal 2015-10-09 11:08:31 +0200
  • caff4638c9 * Fix website/test_api.py for Python 3 Matthew Honnibal 2015-10-09 11:08:12 +0200
  • a510858f5a * Pretty-print specials.json, and add the em dash Matthew Honnibal 2015-10-09 11:07:45 +0200
  • 49600a44a8 * Fix trailing comma in lemma_rules.json Matthew Honnibal 2015-10-09 11:06:57 +0200
  • 0e92e8574a * Fix pos tag in em-dash in specials Matthew Honnibal 2015-10-09 11:06:37 +0200
  • d341443282 * Remove em-dash from lemma rules. Handle instead in specials. Matthew Honnibal 2015-10-09 10:27:13 +0200
  • b6047afe4c * Fix punctuation lemma rules, to resolve Issue #130 Matthew Honnibal 2015-10-09 10:25:37 +0200
  • 393a13d1af * Add unicode em dash to specials.json, so that we can control what POS tag it gets. This way we can prevent sentence boundary detection errors, to address Issue #130. Matthew Honnibal 2015-10-09 19:24:33 +1100
  • 1490feda29 * Make generate_specials pretty-print the specials.json file Matthew Honnibal 2015-10-09 19:23:47 +1100
  • 1842a53e73 * Lemmatize smart quotes as plain quotes Matthew Honnibal 2015-10-09 19:09:36 +1100
  • 2d9e5bf566 * Allow punctuation to be lemmatized Matthew Honnibal 2015-10-09 19:02:42 +1100
  • 5332c0b697 * Add support for punctuation lemmatization, to handle unicode characters. This should help in addressing Issue #130 Matthew Honnibal 2015-10-09 18:44:21 +1100
  • b71ba2eed5 * Add tests for unicode puncuation character lemmatization Matthew Honnibal 2015-10-09 18:43:14 +1100
  • 9a6811acc4 Merge remote-tracking branch 'upstream/master' Yubing (Tom) Dong 2015-10-08 22:53:02 -0700
  • 0e13f18ea4 remove compile warning noise Henning Peters 2015-10-09 07:23:39 +0200
  • c5b2c4ead8 * Don't build old license page Matthew Honnibal 2015-10-09 14:58:45 +1100
  • 4bae38128d * Remove license page from website in repo Matthew Honnibal 2015-10-09 14:58:34 +1100
  • 00c1992503 * Mark tests that require models Matthew Honnibal 2015-10-09 14:48:14 +1100
  • dea40cfec3 * Mark tests that require models Matthew Honnibal 2015-10-09 14:37:48 +1100
  • 5031440c35 * Mark tests that require models Matthew Honnibal 2015-10-09 14:29:28 +1100
  • 76936a3456 * Mark tests that require models Matthew Honnibal 2015-10-09 14:19:07 +1100
  • 7b340912d4 * Mark tests that require models Matthew Honnibal 2015-10-09 14:09:26 +1100
  • 20b8c3e281 * Mark tests that require models Matthew Honnibal 2015-10-09 13:58:01 +1100
  • b125289f30 * Fix type declaration in asciied function Matthew Honnibal 2015-10-09 13:46:57 +1100