Commit Graph

5276 Commits

Author SHA1 Message Date
Matthew Honnibal
4ad24abb7e Merge pull request #1447 from mayukh18/bengali_pronouns
added a few bengali pronouns
2017-10-24 10:22:17 +02:00
Matthew Honnibal
72a48dec14 Merge pull request #1454 from jnothman/patch-1
DOC "OP" key in token spec
2017-10-24 10:08:46 +02:00
Joel Nothman
80a9652617 DOC "OP" key in token spec 2017-10-24 15:48:22 +11:00
Ines Montani
00fc2db7ef Merge pull request #1449 from jerbob92/feature/add-jerbob92-to-contributors
Add myself to contributors
2017-10-22 15:43:05 +02:00
Jeroen Bobbeldijk
5c7c08c2e3 Add myself to contributors 2017-10-22 15:35:46 +02:00
mayukh18
80edc905f7 added a few bengali pronouns 2017-10-22 13:16:39 +05:30
Ines Montani
2a0ab6fafa Merge pull request #1435 from ramananbalakrishnan/update_to_array
Support single value for attribute list in doc.to_array
2017-10-20 13:21:48 +02:00
Matthew Honnibal
dbc276e3b2 Fix 'toupper()' -> 'upper()' 2017-10-20 13:02:13 +02:00
Matthew Honnibal
7a46792376 Fix compile error
Closures not allowed in cpdef
2017-10-20 11:53:47 +02:00
Matthew Honnibal
658536b5ce Fix to_array compile error 2017-10-20 11:35:10 +02:00
Matthew Honnibal
c0799430a7 Make small changes to Doc.to_array
* Change type-check logic to 'hasattr' (Python type-checking is brittle)
* Small 'house style' edits, mostly making code more terse.
2017-10-20 11:17:00 +02:00
Ramanan Balakrishnan
fbccc8c87d
Update documentation on doc.to_array 2017-10-20 14:23:48 +05:30
Ramanan Balakrishnan
5941aa96a1
Support strings for attribute list in doc.to_array 2017-10-20 11:59:34 +05:30
Matthew Honnibal
45b41fcec8 Merge pull request #1441 from johnhaley81/patch-1
Fix Keras install in keras_parikeh_entailment README
2017-10-20 03:09:38 +02:00
John Haley
989814c4b6 Create johnhaley81.md 2017-10-19 09:11:16 -07:00
John Haley
44c61fde25 Fix Keras install in keras_parikeh_entailment
The master branch of Keras doesn't work with this example anymore so this pins Keras to version 1.2.2 for this example.
2017-10-19 08:56:28 -07:00
Ines Montani
0a0d8c8c9f Merge pull request #1437 from Demfier/readme_update
Minor README update
2017-10-18 23:36:21 +02:00
demfier
772c8035f7 Sign SCA 2017-10-18 23:12:24 +05:30
demfier
0b9e1d3660 Merge branch 'master' of https://github.com/explosion/spaCy into readme_update 2017-10-18 22:33:42 +05:30
demfier
f39fc34c95 Add minor update in README 2017-10-18 22:32:58 +05:30
Ines Montani
e7b78370d9 Add note on origin of manually moved agreement
See 8a2d22222d
2017-10-18 14:41:38 +02:00
Ines Montani
3357588b9f Create honnibal.md 2017-10-18 14:41:31 +02:00
Ines Montani
0b239ee646 Create ines.md 2017-10-18 14:37:08 +02:00
Ines Montani
9162ecb43f Update CONTRIBUTOR_AGREEMENT.md 2017-10-18 14:36:19 +02:00
Matthew Honnibal
e787045cf5 Revert "filled up CONTRIBUTOR_AGREEMENT.md"
This reverts commit 8a2d22222d.
2017-10-18 14:31:57 +02:00
Ines Montani
5a4b5b362c Create shuvanon.md 2017-10-18 14:29:10 +02:00
Ines Montani
8bd9b05fdc Update CONTRIBUTING.md 2017-10-18 14:13:36 +02:00
Ramanan Balakrishnan
b47b4e2654
Support single value for attribute list in doc.to_scalar conversion 2017-10-18 14:43:47 +05:30
Matthew Honnibal
cd9378c8f1 Merge pull request #1423 from yuukos/master
Fixed Russian tokenizer
2017-10-16 11:45:53 +02:00
Matthew Honnibal
6b0121091c Merge pull request #1420 from polm/master
[ja] Stash tokenizer output for speed
2017-10-16 10:28:22 +02:00
yuukos
34e9c6ddc0 Merge remote-tracking branch 'origin/master' 2017-10-16 13:48:10 +07:00
yuukos
92931a2efd Merge branch 'russian_language' 2017-10-16 13:46:28 +07:00
yuukos
241d19a3e6 fixed Russian Tokenizer
- added trailing space flags for tokens
2017-10-16 13:37:05 +07:00
Paul O'Leary McCann
71ae8013ec [ja] Use user_details instead of a wrapper class
Instead of using a JapaneseDoc wrapper class to store Mecab output,
stash it in `user_data`. -POLM
2017-10-16 00:24:34 +09:00
Paul O'Leary McCann
43eedf73f2 [ja] Stash tokenizer output for speed
Before this commit, the Mecab tokenizer had to be called twice when
creating a Doc- once during tokenization and once during tagging. This
creates a JapaneseDoc wrapper class for Doc that stashes the parsed
tokenizer output to remove redundant processing. -POLM
2017-10-15 23:33:25 +09:00
Ines Montani
e00a6c08cf Merge pull request #1418 from polm/master
Contributor agreement
2017-10-14 15:10:58 +02:00
Paul O'Leary McCann
a31d33be06 Contributor agreement 2017-10-14 19:28:04 +09:00
Ines Montani
4b5af8bd17 Merge pull request #1414 from yuukos/master
Adding Russian language support
2017-10-13 17:03:52 +02:00
Alex
95836abee1 Update CONTRIBUTORS.md 2017-10-13 21:02:19 +07:00
Alex
ce00405afc Create yuukos.md 2017-10-13 21:00:15 +07:00
yuukos
6fb9d75bd2 fixed test with creating tokenizer 2017-10-13 15:51:03 +07:00
yuukos
a229b6e0de added tests for Russian language
added tests of creating Russian Language instance and Russian tokenizer
2017-10-13 14:04:37 +07:00
yuukos
622b6d6270 updated Russian tokenizer
moved the trying to import pymorph into __init__
2017-10-13 13:57:29 +07:00
yuukos
f81dd284eb updated spacy/__init__.py
registered russian language via set_lang_class
2017-10-12 22:28:34 +07:00
yuukos
7b9491679f added russian language support 2017-10-12 22:24:20 +07:00
yuukos
2a78f4d634 updated .gitignore file
added excluding PyCharm's idea directory
2017-10-12 22:23:19 +07:00
Ines Montani
a06b84e7cc Merge pull request #1407 from hscspring/patch-6
Update training.jade
2017-10-11 14:25:38 +02:00
Ines Montani
ffc2fef13c Merge pull request #1411 from raphael0202/issue_1078
Resolve issue #1078 by simplifying URL pattern
2017-10-11 11:54:57 +02:00
Raphaël Bournhonesque
3452d6ce52 Resolve issue #1078 by simplifying URL pattern
- avoid catastrophic backtracking
- reduce character range of host name, domain name and TLD identifier
2017-10-11 11:24:00 +02:00
Yam
efe0800f91 Update training.jade
fix several changes
2017-10-09 21:39:15 -05:00