Commit Graph

7924 Commits

Author SHA1 Message Date
ines
22dc744b48 Fix check for '@' in like_url (see #1715) 2017-12-16 13:48:43 +01:00
ines
21482b391b Fix head 2017-12-16 13:48:19 +01:00
Ines Montani
7a6f24a194
Merge pull request #1720 from mpuels/patch-6
doc: Fix minor mistakes
2017-12-13 11:11:59 +00:00
Ines Montani
aad26965bd
Merge pull request #1719 from mpuels/patch-5
fix: Add missing period in train data
2017-12-13 11:10:57 +00:00
mpuels
3f7bedadee
doc: Fix minor mistakes 2017-12-13 11:37:24 +01:00
mpuels
1e8147aec7
fix: Add missing period in train data 2017-12-13 10:51:05 +01:00
Ines Montani
1e61fffd0a
Merge pull request #1715 from Bri-Will/master (resolves #1698)
Update lex_attrs.py. Fix like_url from matching on e-mail
2017-12-12 10:50:10 +00:00
Ines Montani
9c1ee65268
Add regression test for #1698 2017-12-12 10:36:11 +01:00
Ines Montani
6455b574fc
Check for email address first 2017-12-12 10:25:13 +01:00
Bri-Will
afd9fc9d36
Adds contributor agreement for Bri-Will 2017-12-11 14:38:37 -08:00
Bri-Will
d77361d76c
Update lex_attrs.py. Fix like_url from matching on e-mail 2017-12-11 14:13:28 -08:00
Ines Montani
08e2c77368
Merge pull request #1710 from sorenlind/init_model_plac
Remove abbreviation for positional plac argument
2017-12-11 15:15:11 +00:00
Søren Lind Kristiansen
5a9d377580 Remove abbreviation for positional plac argument 2017-12-11 11:08:29 +01:00
Ines Montani
9b25605c3b
Merge pull request #1708 from IsaacHaze/issue_1622 (fixes #1622)
Fix Issue 1622
2017-12-11 01:23:59 +00:00
Isaac Sijaranamual
38021fbb00 Switch from python 3 only TemporaryDirectory to pytest's tmpdir 2017-12-11 00:16:04 +01:00
Isaac Sijaranamual
f32c6630cb Adds contributor agreement IsaacHaze 2017-12-10 23:15:06 +01:00
Isaac Sijaranamual
20ae0c459a Fixes "Error saving model" #1622 2017-12-10 23:07:13 +01:00
Isaac Sijaranamual
568130ce7c Adds regression test_issue1622 2017-12-10 23:00:48 +01:00
Isaac Sijaranamual
e188b61960 Make cli/train.py not eat exception 2017-12-10 22:53:08 +01:00
ines
020a7e5d52 Allow 'fine_grained' option in displaCy (see #1703)
Shows token.tag_ instead of token.pos_. Disabled by default, to not cause rendering issues for models with long fine-grained tags (e.g. merged morphological features).
2017-12-09 15:11:12 +01:00
Ines Montani
d8dd484dc0
Merge pull request #1705 from mpuels/patch-4
Fix typo in comment
2017-12-09 14:02:50 +00:00
mpuels
ee4d6fdd40
Fix typo in comment 2017-12-09 13:14:57 +01:00
Ines Montani
51d3ab2137
Revert contributor agreement to empty form 2017-12-07 16:22:30 +01:00
Matthew Honnibal
3b17eb7c49 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-07 10:39:32 +01:00
Matthew Honnibal
a6b43729c6 Set version to v2.0.5 2017-12-07 10:39:14 +01:00
ines
5eaa61c2b8 Fix formatting 2017-12-07 10:23:09 +01:00
ines
24e80c51b8 Document init-model command 2017-12-07 10:14:37 +01:00
Matthew Honnibal
c91f451b0f Fix imports and CLI in init-model 2017-12-07 10:03:07 +01:00
ines
82e80ff928 Rename model command to init_model and fix formatting 2017-12-07 09:59:23 +01:00
Ines Montani
2feeb428d6
Merge pull request #1646 from GreenRiverRUS/master
Added model command to create models from raw data
2017-12-07 08:54:26 +00:00
Matthew Honnibal
6373d2580d Increment version to v2.0.5.dev0 2017-12-07 09:53:59 +01:00
Matthew Honnibal
36b47e3fa6 Fix (and test) vector pickling 2017-12-07 09:53:30 +01:00
Ines Montani
2ae4755def
Merge pull request #1689 from mpuels/patch-3
doc: Replace 'is not' with '!=' in code example
2017-12-07 06:10:28 +00:00
mpuels
e3af19a076
doc: Replace 'is not' with '!=' in code example
The function `dependency_labels_to_root(token)` defined in section *Get syntactic dependencies* does not terminate. Here is a complete example:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head is not token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

Replacing `is not` with `!=` solves the issue:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head != token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

The output is

    ['cc', 'nsubj']
2017-12-06 20:08:42 +01:00
Ines Montani
050a94867b
Merge pull request #1688 from mpuels/patch-2
doc: Fix assert statement in Lightning Tour
2017-12-06 16:17:43 +00:00
mpuels
82e575ebfb
doc: Fix assert statement in Lightning Tour
Python 3 throws an error message on the original assert statement. Also, according to the Python documentation regarding the assert statement (https://docs.python.org/3/reference/simple_stmts.html#the-assert-statement), `assert` takes at least one argument and at most two. In the two-argument form the second argument is meant as an error message to be displayed when the assertion fails. I don't think this is intended in this case.
2017-12-06 16:40:51 +01:00
Ines Montani
798c4c14a7
Merge pull request #1687 from mpuels/patch-1
doc: Add missing *-operator to nlp.disable_pipes() in Lightning Tour
2017-12-06 14:36:29 +00:00
mpuels
662601f01c
doc: Add missing *-operator to nlp.disable_pipes()
I'm using SpaCy version 2.0.3. If I don't use the *-operator in the example, Python throws an error message. With the operator it works fine. Also according to the documentation of the function `nlp.disable_pipes()`, it expects one or more strings as arguments and not one argument being a list of strings.
2017-12-06 15:26:43 +01:00
ines
b078e276e6 Document offsets_from_biluo_tags 2017-12-06 13:40:51 +01:00
ines
fb663f9b7d Add Russian to list of languages 2017-12-06 13:40:32 +01:00
Matthew Honnibal
05f41ff587 Set version to 2.0.4 2017-12-06 13:24:02 +01:00
Matthew Honnibal
2b2ce125d5 Fix thinc version pin 2017-12-06 13:23:35 +01:00
Matthew Honnibal
04c38f7e87 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-06 12:15:52 +01:00
Matthew Honnibal
361944e512 If no rules are set, lemmatize by lookup 2017-12-06 12:12:11 +01:00
Matthew Honnibal
2ab0f2d186
Merge pull request #1664 from jimregan/italian-lemmatizer
BOM in Italian lemmatiser
2017-12-06 11:09:04 +01:00
Matthew Honnibal
3f247119d3
Merge pull request #1668 from sorenlind/da_morph
Add more Danish morph rules and clean up existing ones
2017-12-06 11:08:09 +01:00
Matthew Honnibal
04a92bd75e Pin msgpack-numpy requirement 2017-12-06 03:24:24 +01:00
Matthew Honnibal
b2f1cf8775 Try to fix travis locale problem 2017-12-06 01:50:03 +01:00
Matthew Honnibal
b051c11da7 Try to fix travis locale problem 2017-12-06 01:45:22 +01:00
Matthew Honnibal
8d27f091c8 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 17:01:35 +01:00