Matthew Honnibal
8775efbfdf
Merge pull request #1120 from raphael0202/pattern
...
Implementation of dependency pattern-matching algorithm
2017-10-24 11:44:23 +02:00
Matthew Honnibal
4ad24abb7e
Merge pull request #1447 from mayukh18/bengali_pronouns
...
added a few bengali pronouns
2017-10-24 10:22:17 +02:00
Matthew Honnibal
72a48dec14
Merge pull request #1454 from jnothman/patch-1
...
DOC "OP" key in token spec
2017-10-24 10:08:46 +02:00
Joel Nothman
80a9652617
DOC "OP" key in token spec
2017-10-24 15:48:22 +11:00
Ines Montani
00fc2db7ef
Merge pull request #1449 from jerbob92/feature/add-jerbob92-to-contributors
...
Add myself to contributors
2017-10-22 15:43:05 +02:00
Jeroen Bobbeldijk
5c7c08c2e3
Add myself to contributors
2017-10-22 15:35:46 +02:00
mayukh18
80edc905f7
added a few bengali pronouns
2017-10-22 13:16:39 +05:30
Ines Montani
2a0ab6fafa
Merge pull request #1435 from ramananbalakrishnan/update_to_array
...
Support single value for attribute list in doc.to_array
2017-10-20 13:21:48 +02:00
Matthew Honnibal
dbc276e3b2
Fix 'toupper()' -> 'upper()'
2017-10-20 13:02:13 +02:00
Matthew Honnibal
7a46792376
Fix compile error
...
Closures not allowed in cpdef
2017-10-20 11:53:47 +02:00
Matthew Honnibal
658536b5ce
Fix to_array compile error
2017-10-20 11:35:10 +02:00
Matthew Honnibal
c0799430a7
Make small changes to Doc.to_array
...
* Change type-check logic to 'hasattr' (Python type-checking is brittle)
* Small 'house style' edits, mostly making code more terse.
2017-10-20 11:17:00 +02:00
Ramanan Balakrishnan
fbccc8c87d
Update documentation on doc.to_array
2017-10-20 14:23:48 +05:30
Ramanan Balakrishnan
5941aa96a1
Support strings for attribute list in doc.to_array
2017-10-20 11:59:34 +05:30
Matthew Honnibal
45b41fcec8
Merge pull request #1441 from johnhaley81/patch-1
...
Fix Keras install in keras_parikeh_entailment README
2017-10-20 03:09:38 +02:00
John Haley
989814c4b6
Create johnhaley81.md
2017-10-19 09:11:16 -07:00
John Haley
44c61fde25
Fix Keras install in keras_parikeh_entailment
...
The master branch of Keras doesn't work with this example anymore so this pins Keras to version 1.2.2 for this example.
2017-10-19 08:56:28 -07:00
Ines Montani
0a0d8c8c9f
Merge pull request #1437 from Demfier/readme_update
...
Minor README update
2017-10-18 23:36:21 +02:00
demfier
772c8035f7
Sign SCA
2017-10-18 23:12:24 +05:30
demfier
0b9e1d3660
Merge branch 'master' of https://github.com/explosion/spaCy into readme_update
2017-10-18 22:33:42 +05:30
demfier
f39fc34c95
Add minor update in README
2017-10-18 22:32:58 +05:30
Ines Montani
e7b78370d9
Add note on origin of manually moved agreement
...
See 8a2d22222d
2017-10-18 14:41:38 +02:00
Ines Montani
3357588b9f
Create honnibal.md
2017-10-18 14:41:31 +02:00
Ines Montani
0b239ee646
Create ines.md
2017-10-18 14:37:08 +02:00
Ines Montani
9162ecb43f
Update CONTRIBUTOR_AGREEMENT.md
2017-10-18 14:36:19 +02:00
Matthew Honnibal
e787045cf5
Revert "filled up CONTRIBUTOR_AGREEMENT.md"
...
This reverts commit 8a2d22222d
.
2017-10-18 14:31:57 +02:00
Ines Montani
5a4b5b362c
Create shuvanon.md
2017-10-18 14:29:10 +02:00
Ines Montani
8bd9b05fdc
Update CONTRIBUTING.md
2017-10-18 14:13:36 +02:00
Ramanan Balakrishnan
b47b4e2654
Support single value for attribute list in doc.to_scalar conversion
2017-10-18 14:43:47 +05:30
Matthew Honnibal
cd9378c8f1
Merge pull request #1423 from yuukos/master
...
Fixed Russian tokenizer
2017-10-16 11:45:53 +02:00
Matthew Honnibal
6b0121091c
Merge pull request #1420 from polm/master
...
[ja] Stash tokenizer output for speed
2017-10-16 10:28:22 +02:00
yuukos
34e9c6ddc0
Merge remote-tracking branch 'origin/master'
2017-10-16 13:48:10 +07:00
yuukos
92931a2efd
Merge branch 'russian_language'
2017-10-16 13:46:28 +07:00
yuukos
241d19a3e6
fixed Russian Tokenizer
...
- added trailing space flags for tokens
2017-10-16 13:37:05 +07:00
Paul O'Leary McCann
71ae8013ec
[ja] Use user_details instead of a wrapper class
...
Instead of using a JapaneseDoc wrapper class to store Mecab output,
stash it in `user_data`. -POLM
2017-10-16 00:24:34 +09:00
Paul O'Leary McCann
43eedf73f2
[ja] Stash tokenizer output for speed
...
Before this commit, the Mecab tokenizer had to be called twice when
creating a Doc- once during tokenization and once during tagging. This
creates a JapaneseDoc wrapper class for Doc that stashes the parsed
tokenizer output to remove redundant processing. -POLM
2017-10-15 23:33:25 +09:00
Ines Montani
e00a6c08cf
Merge pull request #1418 from polm/master
...
Contributor agreement
2017-10-14 15:10:58 +02:00
Paul O'Leary McCann
a31d33be06
Contributor agreement
2017-10-14 19:28:04 +09:00
Ines Montani
4b5af8bd17
Merge pull request #1414 from yuukos/master
...
Adding Russian language support
2017-10-13 17:03:52 +02:00
Alex
95836abee1
Update CONTRIBUTORS.md
2017-10-13 21:02:19 +07:00
Alex
ce00405afc
Create yuukos.md
2017-10-13 21:00:15 +07:00
yuukos
6fb9d75bd2
fixed test with creating tokenizer
2017-10-13 15:51:03 +07:00
yuukos
a229b6e0de
added tests for Russian language
...
added tests of creating Russian Language instance and Russian tokenizer
2017-10-13 14:04:37 +07:00
yuukos
622b6d6270
updated Russian tokenizer
...
moved the trying to import pymorph into __init__
2017-10-13 13:57:29 +07:00
yuukos
f81dd284eb
updated spacy/__init__.py
...
registered russian language via set_lang_class
2017-10-12 22:28:34 +07:00
yuukos
7b9491679f
added russian language support
2017-10-12 22:24:20 +07:00
yuukos
2a78f4d634
updated .gitignore file
...
added excluding PyCharm's idea directory
2017-10-12 22:23:19 +07:00
Ines Montani
a06b84e7cc
Merge pull request #1407 from hscspring/patch-6
...
Update training.jade
2017-10-11 14:25:38 +02:00
Ines Montani
ffc2fef13c
Merge pull request #1411 from raphael0202/issue_1078
...
Resolve issue #1078 by simplifying URL pattern
2017-10-11 11:54:57 +02:00
Raphaël Bournhonesque
3452d6ce52
Resolve issue #1078 by simplifying URL pattern
...
- avoid catastrophic backtracking
- reduce character range of host name, domain name and TLD identifier
2017-10-11 11:24:00 +02:00