Commit Graph

13220 Commits

Author SHA1 Message Date
Matthew Honnibal
3b17eb7c49 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-07 10:39:32 +01:00
Matthew Honnibal
a6b43729c6 Set version to v2.0.5 2017-12-07 10:39:14 +01:00
ines
5eaa61c2b8 Fix formatting 2017-12-07 10:23:09 +01:00
ines
24e80c51b8 Document init-model command 2017-12-07 10:14:37 +01:00
Matthew Honnibal
c91f451b0f Fix imports and CLI in init-model 2017-12-07 10:03:07 +01:00
ines
82e80ff928 Rename model command to init_model and fix formatting 2017-12-07 09:59:23 +01:00
Ines Montani
2feeb428d6
Merge pull request #1646 from GreenRiverRUS/master
Added model command to create models from raw data
2017-12-07 08:54:26 +00:00
Matthew Honnibal
6373d2580d Increment version to v2.0.5.dev0 2017-12-07 09:53:59 +01:00
Matthew Honnibal
36b47e3fa6 Fix (and test) vector pickling 2017-12-07 09:53:30 +01:00
Ines Montani
2ae4755def
Merge pull request #1689 from mpuels/patch-3
doc: Replace 'is not' with '!=' in code example
2017-12-07 06:10:28 +00:00
mpuels
e3af19a076
doc: Replace 'is not' with '!=' in code example
The function `dependency_labels_to_root(token)` defined in section *Get syntactic dependencies* does not terminate. Here is a complete example:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head is not token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

Replacing `is not` with `!=` solves the issue:

    import spacy
    
    nlp = spacy.load('en')
    doc = nlp("Apple and banana are similar. Pasta and hippo aren't.")
    
    def dependency_labels_to_root(token):
        """Walk up the syntactic tree, collecting the arc labels."""
        dep_labels = []
        while token.head != token:
            dep_labels.append(token.dep)
            token = token.head
        return dep_labels
    
    dep_labels = dependency_labels_to_root(doc[1])
    dep_labels

The output is

    ['cc', 'nsubj']
2017-12-06 20:08:42 +01:00
Ines Montani
050a94867b
Merge pull request #1688 from mpuels/patch-2
doc: Fix assert statement in Lightning Tour
2017-12-06 16:17:43 +00:00
mpuels
82e575ebfb
doc: Fix assert statement in Lightning Tour
Python 3 throws an error message on the original assert statement. Also, according to the Python documentation regarding the assert statement (https://docs.python.org/3/reference/simple_stmts.html#the-assert-statement), `assert` takes at least one argument and at most two. In the two-argument form the second argument is meant as an error message to be displayed when the assertion fails. I don't think this is intended in this case.
2017-12-06 16:40:51 +01:00
Ines Montani
798c4c14a7
Merge pull request #1687 from mpuels/patch-1
doc: Add missing *-operator to nlp.disable_pipes() in Lightning Tour
2017-12-06 14:36:29 +00:00
mpuels
662601f01c
doc: Add missing *-operator to nlp.disable_pipes()
I'm using SpaCy version 2.0.3. If I don't use the *-operator in the example, Python throws an error message. With the operator it works fine. Also according to the documentation of the function `nlp.disable_pipes()`, it expects one or more strings as arguments and not one argument being a list of strings.
2017-12-06 15:26:43 +01:00
ines
b078e276e6 Document offsets_from_biluo_tags 2017-12-06 13:40:51 +01:00
ines
fb663f9b7d Add Russian to list of languages 2017-12-06 13:40:32 +01:00
Matthew Honnibal
05f41ff587 Set version to 2.0.4 2017-12-06 13:24:02 +01:00
Matthew Honnibal
2b2ce125d5 Fix thinc version pin 2017-12-06 13:23:35 +01:00
Matthew Honnibal
04c38f7e87 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-06 12:15:52 +01:00
Matthew Honnibal
361944e512 If no rules are set, lemmatize by lookup 2017-12-06 12:12:11 +01:00
Matthew Honnibal
2ab0f2d186
Merge pull request #1664 from jimregan/italian-lemmatizer
BOM in Italian lemmatiser
2017-12-06 11:09:04 +01:00
Matthew Honnibal
3f247119d3
Merge pull request #1668 from sorenlind/da_morph
Add more Danish morph rules and clean up existing ones
2017-12-06 11:08:09 +01:00
Matthew Honnibal
04a92bd75e Pin msgpack-numpy requirement 2017-12-06 03:24:24 +01:00
Matthew Honnibal
b2f1cf8775 Try to fix travis locale problem 2017-12-06 01:50:03 +01:00
Matthew Honnibal
b051c11da7 Try to fix travis locale problem 2017-12-06 01:45:22 +01:00
Matthew Honnibal
8d27f091c8 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 17:01:35 +01:00
Matthew Honnibal
beb5ad24c5 Update travis 2017-12-05 17:01:14 +01:00
ines
58a19518cf Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 13:17:58 +01:00
ines
7ade336ab7 Add "Unknown locale" issue to troubleshooting guide (see #1684, #1641, #1517) 2017-12-05 13:17:55 +01:00
Matthew Honnibal
b712de774e Fix vectors pickling 2017-12-05 12:45:24 +01:00
Matthew Honnibal
a59518df4f Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-05 11:33:20 +01:00
Matthew Honnibal
5719fd1ce9 Require dev1 of thinc 2017-12-05 10:52:48 +01:00
Matthew Honnibal
04650e38c7 Set version to 2.0.4.dev0 2017-12-05 10:52:31 +01:00
Ines Montani
6c95187915
Merge pull request #1683 from mkdynamic/patch-1
Fix link to CLEAR Style dependency labels PDF
2017-12-05 08:57:38 +00:00
Mark Dodwell
9d4c185860
Fix link to CLEAR Style dependency labels PDF 2017-12-04 23:28:06 -08:00
Matthew Honnibal
07acb43a85 Merge branch 'master' of https://github.com/explosion/spaCy 2017-12-04 14:42:52 +01:00
Matthew Honnibal
f5cae1f598
Merge pull request #1674 from twerkmeister/patch-1
fix setup.py spacy req string for packaging
2017-12-04 14:04:09 +01:00
Thomas Werkmeister
94eac75b7c
fix setup.py spacy req string for packaging
Requirement should be `spacy>=2.0.2` instead of `spacy2.0.2`
2017-12-03 04:16:28 -06:00
ines
40638b7cdf Update resources 2017-12-02 04:16:03 +01:00
ines
f2ea6d4713 Add Dutch example sentences (see #1107) 2017-12-01 23:36:05 +01:00
ines
9ea8a7cf0c Add spacy_cld to extensions 2017-12-01 23:21:33 +01:00
Canbey Bilgili
86ac8ea5ba Adds Canbey Bilgili's Contributor Agreement 2017-12-01 17:27:41 +03:00
Canbey Bilgili
abe098b255 Adds Turkish Lemmatization 2017-12-01 17:04:32 +03:00
Ines Montani
a07b44fb47
Merge pull request #1667 from GreenRiverRUS/russian_bugfixies
Bugfixies in russian support
2017-11-30 20:37:43 +00:00
Søren Lind Kristiansen
d86b537a38 Enable morph rules for Danish 2017-11-30 15:58:02 +01:00
Søren Lind Kristiansen
13a988adc3 Remove 'Number[psor]' 2017-11-30 15:55:04 +01:00
Søren Lind Kristiansen
dd6fde18a9 Add more Danish morph rules and clean up existing ones 2017-11-30 11:17:19 +01:00
Vadim Mazaev
495eacf470 Merge branch 'model_command' 2017-11-30 12:30:26 +03:00
Vadim Mazaev
4ba7ddf651 Bugfixies 2017-11-30 12:29:38 +03:00