Commit Graph

2294 Commits

Author SHA1 Message Date
Matthew Honnibal
6ea1601e93 * Add script to train models off the UD treebanks. Note that the UD data is restricted to research purposes only, and should only be used to train models for academic experiments. 2015-10-08 12:01:08 +11:00
Chris DuBois
e095faa785 Add contributor. 2015-10-07 17:55:46 -07:00
chrisdubois
cc47b8ad6a Fix size of allocation when creating a pattern
Each pattern object currently contains two AttrValues rather than just one.
2015-10-07 10:32:55 -07:00
Yubing (Tom) Dong
0f601b8b75 Update docstring of Doc.__getitem__ 2015-10-07 01:27:28 -07:00
Yubing (Tom) Dong
3fd3bc79aa Refactor to remove duplicate slicing logic 2015-10-07 01:25:35 -07:00
Yubing (Tom) Dong
97685aecb7 Add slicing support to Span 2015-10-06 02:45:49 -07:00
Yubing (Tom) Dong
5cc2f2b01a Test simple indexing for Span 2015-10-06 02:41:46 -07:00
Yubing (Tom) Dong
ef2af20cd3 Make Doc's slicing behavior conform to Python conventions 2015-10-06 02:41:28 -07:00
Yubing (Tom) Dong
2fc33e8024 Allow step=1 when slicing a Doc 2015-10-06 00:57:05 -07:00
Yubing (Tom) Dong
73566899bf Add Doc slicing tests 2015-10-06 00:57:01 -07:00
Matthew Honnibal
b228a8f4a6 * Remove spacy/en/attrs 2015-10-06 16:20:46 +11:00
Matthew Honnibal
693677fd8d * Prepare to remove en/attrx file, now that moving to symbols.pyx 2015-10-06 16:20:13 +11:00
Matthew Honnibal
63bd17135f * Whitespace 2015-10-06 10:37:07 +11:00
Matthew Honnibal
e7c31f7eae * Tweak information extraction example 2015-10-06 10:35:49 +11:00
Matthew Honnibal
c503654ec1 * Update bin/parser/train for printing output. 2015-10-06 10:35:22 +11:00
Matthew Honnibal
3d9f41c2c9 * Add LookupError for better error reporting in Vocab 2015-10-06 10:34:59 +11:00
Matthew Honnibal
ecc5281b36 * Remove en/pos.pyx, as the tagger code now lives in spacy/tagger.pyx 2015-10-06 10:12:08 +11:00
Matthew Honnibal
e4ba8a4b5a * Add multi word matching code 2015-10-06 09:06:52 +11:00
Matthew Honnibal
262c215b55 examples/information_extraction.py
* Add very simple information extraction snippet.
2015-10-01 22:27:57 +10:00
Matthew Honnibal
fd72b8b282 * Add a test for Issue #118: Matcher behaves unpredictably with overlapping entities 2015-10-01 16:21:00 +10:00
alvations
8caedba42a caught more codecs.open -> io.open 2015-09-30 20:20:09 +02:00
alvations
764bdc62e7 caught another codecs.open 2015-09-30 20:16:52 +02:00
alvations
8199012d26 changing deprecated codecs.open to io.open =) 2015-09-30 20:10:15 +02:00
Matthew Honnibal
73928001ed * Set details(open=true) on docs while we redesign 2015-09-30 11:48:15 +10:00
Matthew Honnibal
04c92d4f89 * Update comparisons 2015-09-29 23:07:00 +10:00
Matthew Honnibal
e7a7f3bd63 * Fix indentation error in API docs. 2015-09-29 23:05:04 +10:00
Matthew Honnibal
bf4d30c5b6 * Fix test failures in test_api 2015-09-29 23:04:20 +10:00
Matthew Honnibal
87e6186828 * Rename _seq to doc attribute in Span 2015-09-29 23:03:55 +10:00
Matthew Honnibal
ab694b0364 * Fix open-bounded slice indices. 2015-09-29 23:03:09 +10:00
Matthew Honnibal
e562f504ee * Fix license metadata in setup.py 2015-09-29 23:02:37 +10:00
Matthew Honnibal
69f0c2cd26 * Fix typo in README 2015-09-29 23:02:08 +10:00
Matthew Honnibal
a6ced80c0c * Fix Issue #116: Misleading handling of True value in Language.__init__. 2015-09-29 20:54:12 +10:00
Matthew Honnibal
f9d2a5b651 * Fix issue #112: Replace unidecode with text-unidecode, to avoid license problems. 2015-09-28 23:40:18 +10:00
Matthew Honnibal
37729b9592 Merge pull request #111 from henningpeters/master
add doctests for website 'api'-section
2015-09-28 22:40:48 +10:00
Henning Peters
b364be73dc fix merge 2015-09-28 14:36:17 +02:00
Henning Peters
5dfd2df686 add doctests for website 'api'-section (merge) 2015-09-28 14:35:11 +02:00
Henning Peters
f0360bf59d add doctests for website 'api'-section 2015-09-28 14:22:13 +02:00
Matthew Honnibal
c03e74272b * Remove extraneous file. 2015-09-28 21:54:44 +10:00
Matthew Honnibal
d8276b3792 * Move test_home to within tests/ 2015-09-28 19:34:34 +10:00
Matthew Honnibal
5ca1646d8a * Mark model-requiring tests 2015-09-28 18:07:11 +10:00
Matthew Honnibal
c3164f9cbe Merge pull request #107 from henningpeters/master
doctests for website: 'home'-section
2015-09-28 17:46:52 +10:00
Matthew Honnibal
2c33a96ac3 Merge pull request #99 from rw/patch-1
Force SSL for downloading English language data.
2015-09-28 17:46:26 +10:00
Henning Peters
936edea425 doctests for website: 'home'-section 2015-09-28 02:39:14 +02:00
Henning Peters
d1850dcbf7 Merge branch 'master' of https://github.com/honnibal/spaCy 2015-09-28 01:13:02 +02:00
Matthew Honnibal
6982ed74ab * Update readme with license change 2015-09-28 07:38:12 +10:00
Matthew Honnibal
098dc2de8b * Change from AGPL to MIT 2015-09-28 07:37:12 +10:00
Matthew Honnibal
412ce6407a * Update Website 2015-09-28 07:35:40 +10:00
Matthew Honnibal
a6e2a44283 * Add clarifying comment 2015-09-27 18:17:41 +10:00
Matthew Honnibal
8c3ec4c140 * Add comments clarifying output of matcher example 2015-09-27 18:13:45 +10:00
Matthew Honnibal
c17e2f2f20 * Add examples for Matcher, to answer Issue #105. TODO: Integrate into docs properly. 2015-09-27 18:08:00 +10:00