Commit Graph

1225 Commits

Author SHA1 Message Date
Ines Montani
96e4d196f5 Reset service worker changes and tidy up package.json 2019-02-26 12:41:18 +01:00
Ines Montani
164559c336 Move Dropdown to own component 2019-02-26 12:40:56 +01:00
Ines Montani
761431f5cb Move polyfill 2019-02-26 12:25:25 +01:00
Ines Montani
5fdce55a1b Add IntersectionObserver polyfill 2019-02-26 12:24:10 +01:00
Ines Montani
bb2b36f65a Improve navigation dropdown 2019-02-26 12:04:47 +01:00
Ines Montani
34bfe70518 Remove service worker 2019-02-26 10:56:15 +01:00
Ines Montani
da74b0bb3e Update gatsby and gatsby-plugin-offline
Trying to fix this: https://github.com/gatsbyjs/gatsby/issues/11524 (mobile only!)
2019-02-26 10:47:51 +01:00
Ines Montani
dbee31c17a Revert "Move DocSearch styles before headComponents"
This reverts commit 1232ccbc0f.
2019-02-26 09:46:07 +01:00
Ines Montani
1232ccbc0f Move DocSearch styles before headComponents 2019-02-25 21:39:10 +01:00
Ines Montani
c5dd450a75 Try and fix search responsiveness [ci skip] 2019-02-25 21:34:28 +01:00
Ines Montani
7d980391f4 Merge branch 'develop' into spacy.io 2019-02-25 20:29:29 +01:00
Ines Montani
3379ebcaa4 Fix default prop [ci skip] 2019-02-25 20:29:11 +01:00
Ines Montani
738426cccf Merge branch 'develop' into spacy.io 2019-02-25 20:22:55 +01:00
Ines Montani
e711969e3b Add more human-readable class names [ci skip] 2019-02-25 20:22:40 +01:00
Ines Montani
0a7a2c73e2 Merge branch 'develop' into spacy.io 2019-02-25 20:11:30 +01:00
Ines Montani
162bd4d75b
💫 Add Algolia DocSearch (#3332)
* Add Algolia DocSearch

* Add human-readable selector for teaser
2019-02-25 20:11:11 +01:00
Ines Montani
bee1966b88 Merge branch 'develop' into spacy.io 2019-02-25 10:03:57 +01:00
Ines Montani
1b6238101a Add table explaining training metrics [closes #2644] 2019-02-25 10:03:43 +01:00
Ines Montani
1981b194cc Fix recomputing of :target [ci skip]
Prevents additional history entry
2019-02-25 10:03:20 +01:00
Ines Montani
e983eefee7 Merge branch 'develop' into spacy.io 2019-02-24 22:22:30 +01:00
Ines Montani
d0b3af9222 Fix remaining inaccuracies in API docs (closes #2329) 2019-02-24 22:21:25 +01:00
Ines Montani
69cfd7d2ce Merge branch 'develop' into spacy.io 2019-02-24 22:02:00 +01:00
Ines Montani
49d0938038 Update version [ci skip] 2019-02-24 22:01:47 +01:00
Ines Montani
17038fe768 Merge branch 'develop' into spacy.io 2019-02-24 21:14:42 +01:00
Ines Montani
62b558ab72 💫 Support lexical attributes in retokenizer attrs (closes #2390) (#3325)
* Fix formatting and whitespace

* Add support for lexical attributes (closes #2390)

* Document lexical attribute setting during retokenization

* Assign variable oputside of nested loop
2019-02-24 21:13:51 +01:00
Ines Montani
41f86f640b Merge branch 'develop' into spacy.io 2019-02-24 18:45:55 +01:00
Ines Montani
aa52305461 Improve pipeline model and meta example [ci skip] 2019-02-24 18:45:39 +01:00
Ines Montani
df19e2bff6
💫 Allow setting of custom attributes during retokenization (closes #3314) (#3324)
<!--- Provide a general summary of your changes in the title. -->

## Description

This PR adds the abilility to override custom extension attributes during merging. This will only work for attributes that are writable, i.e. attributes registered with a default value like `default=False` or attribute that have both a getter *and* a setter implemented.

```python
Token.set_extension('is_musician', default=False)

doc = nlp("I like David Bowie.")
with doc.retokenize() as retokenizer:
    attrs = {"LEMMA": "David Bowie", "_": {"is_musician": True}}
    retokenizer.merge(doc[2:4], attrs=attrs)

assert doc[2].text == "David Bowie"
assert doc[2].lemma_ == "David Bowie"
assert doc[2]._.is_musician
```

### Types of change
enhancement

## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-24 18:38:47 +01:00
Ines Montani
a6709a2f29 Merge branch 'develop' into spacy.io 2019-02-24 18:36:27 +01:00
Ines Montani
948ca2bb3e Merge branch 'develop' into spacy.io 2019-02-24 18:35:32 +01:00
Ines Montani
403b9cd58b Add docs on adding to existing tokenizer rules [ci skip] 2019-02-24 18:35:19 +01:00
Ines Montani
1ea1bc98e7 Document regex utilities [ci skip] 2019-02-24 18:34:10 +01:00
Ines Montani
876ef840c4 Merge branch 'develop' into spacy.io 2019-02-24 13:38:07 +01:00
Ines Montani
09bf08b3c3 Update redirects [ci skip] 2019-02-24 13:37:50 +01:00
Ines Montani
dceca3264d Tidy up package.json [ci skip] 2019-02-24 13:37:41 +01:00
Ines Montani
8458379cf5 Merge branch 'develop' into spacy.io 2019-02-24 13:12:26 +01:00
Ines Montani
46ec5cdccc Update TextCategorizer docs 2019-02-24 13:11:57 +01:00
Ines Montani
c03cb1cc63 Improve built-in component API docs 2019-02-24 13:11:49 +01:00
Ines Montani
f34d6281d6 Merge branch 'develop' into spacy.io 2019-02-24 12:08:15 +01:00
Ines Montani
383e2e1f12 Update Python versions [ci skip] 2019-02-24 11:49:45 +01:00
Ines Montani
b624cb4b89 Update v2-1.md 2019-02-24 11:49:27 +01:00
Ines Montani
250e88ef55 Fix docs example (see #2728) 2019-02-21 14:22:06 +01:00
Ines Montani
ab8392eda3 Merge branch 'develop' into spacy.io 2019-02-21 12:34:51 +01:00
Ines Montani
0fc908d7a5 Add note on merging speed in v2.1 (see #3300) [ci skip] 2019-02-21 12:34:18 +01:00
Ines Montani
236aa94ded Update v2-1.md 2019-02-21 12:33:56 +01:00
Sofie
9a478b6db8 Clean up of char classes, few tokenizer fixes and faster default French tokenizer (#3293)
* splitting up latin unicode interval

* removing hyphen as infix for French

* adding failing test for issue 1235

* test for issue #3002 which now works

* partial fix for issue #2070

* keep the hyphen as infix for French (as it was)

* restore french expressions with hyphen as infix (as it was)

* added succeeding unit test for Issue #2656

* Fix issue #2822 with custom Italian exception

* Fix issue #2926 by allowing numbers right before infix /

* splitting up latin unicode interval

* removing hyphen as infix for French

* adding failing test for issue 1235

* test for issue #3002 which now works

* partial fix for issue #2070

* keep the hyphen as infix for French (as it was)

* restore french expressions with hyphen as infix (as it was)

* added succeeding unit test for Issue #2656

* Fix issue #2822 with custom Italian exception

* Fix issue #2926 by allowing numbers right before infix /

* remove duplicate

* remove xfail for Issue #2179 fixed by Matt

* adjust documentation and remove reference to regex lib
2019-02-20 22:10:13 +01:00
Ines Montani
f73d01aa32 Update netlify.toml [ci skip] 2019-02-20 14:33:32 +01:00
Ines Montani
da5edbe434 Tidy up 2019-02-20 14:33:23 +01:00
Ines Montani
417e86a77f Merge branch 'develop' into spacy.io 2019-02-18 21:50:16 +01:00
Ines Montani
57ae71ea95 Add docs on serializing the pipeline (see #3289) [ci skip] 2019-02-18 14:13:29 +01:00