svlandeg
d3a8321172
fix typos
2020-08-19 15:12:12 +02:00
Ines Montani
225f8866a1
Fix consistency
2020-08-19 12:47:57 +02:00
Ines Montani
9c25656ccc
Update docs [ci skip]
2020-08-19 12:14:41 +02:00
Ines Montani
2285e59765
Merge pull request #5933 from svlandeg/feature/more-v3-docs [ci skip]
2020-08-19 11:29:02 +02:00
Ines Montani
13291e97ba
Update docs [ci skip]
2020-08-19 00:28:37 +02:00
svlandeg
6ed67d495a
format
2020-08-18 19:43:20 +02:00
svlandeg
f9fe5eb323
clean up example
2020-08-18 19:35:23 +02:00
svlandeg
a8acedd4ba
example of custom reader and batcher
2020-08-18 19:15:16 +02:00
svlandeg
abba639565
Merge remote-tracking branch 'upstream/develop' into feature/more-v3-docs
2020-08-18 18:55:12 +02:00
Ines Montani
82f0e20318
Update docs and consistency [ci skip]
2020-08-18 14:39:40 +02:00
Matthew Honnibal
b72bd1767f
Remove todo
2020-08-18 13:52:22 +02:00
Matthew Honnibal
574fd53289
Add precision/recall description
2020-08-18 13:51:08 +02:00
Matthew Honnibal
96a9c65f97
Add model architectures intro
2020-08-18 13:50:55 +02:00
svlandeg
f7b76d2d83
Merge remote-tracking branch 'upstream/develop' into feature/more-v3-docs
2020-08-18 11:57:52 +02:00
svlandeg
8dcda351ec
typo's and quick note on default values
2020-08-18 10:23:27 +02:00
Ines Montani
ef6cf3b276
Update docs [ci skip]
2020-08-18 01:29:34 +02:00
Ines Montani
728fec0194
Update docs [ci skip]
2020-08-18 00:49:19 +02:00
Ines Montani
9299166c75
Merge pull request #5925 from explosion/docs/vectors [ci skip]
...
Update the 'vectors' docs page
2020-08-17 21:45:09 +02:00
svlandeg
4fe4bab1c9
typo fixes
2020-08-17 17:10:15 +02:00
svlandeg
da80c18660
merge develop into branch
2020-08-17 16:57:18 +02:00
Ines Montani
3ae5e02f4f
Update docs, types and API consistency
2020-08-17 16:45:24 +02:00
Matthew Honnibal
052d82aa4e
Suggest vectors changes
2020-08-17 15:32:30 +02:00
svlandeg
961e818be6
p/r definitions
2020-08-17 15:02:39 +02:00
svlandeg
319692aa53
fix typos
2020-08-17 14:05:48 +02:00
Matthew Honnibal
be07567ac6
Update transformers page
2020-08-16 20:29:50 +02:00
Matthew Honnibal
8e5f99ee25
Update transformer docs intro. Also write system requirements
2020-08-16 20:13:24 +02:00
Ines Montani
a570c304df
Update quickstart, template and docs
2020-08-15 14:50:29 +02:00
Ines Montani
950832f087
Tidy up pipes ( #5906 )
...
* Tidy up pipes
* Fix init, defaults and raise custom errors
* Update docs
* Update docs [ci skip]
* Apply suggestions from code review
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
* Tidy up error handling and validation, fix consistency
* Simplify get_examples check
* Remove unused import [ci skip]
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2020-08-11 23:29:31 +02:00
Ines Montani
b7ec06e331
Update docs [ci skip]
2020-08-11 20:57:23 +02:00
Ines Montani
10f42e3a39
Update docs [ci skip]
2020-08-11 00:09:49 +02:00
Ines Montani
2778d04377
Update docs [ci skip]
2020-08-10 23:41:09 +02:00
Ines Montani
023ba7ae26
Update docs
2020-08-10 17:13:11 +02:00
Ines Montani
12052bd8f6
Update docs [ci skip]
2020-08-10 01:20:10 +02:00
Ines Montani
d611cbef43
Update docs [ci skip]
2020-08-10 00:42:26 +02:00
Ines Montani
c044460823
Update docs [ci skip]
2020-08-10 00:01:38 +02:00
Ines Montani
05dcab10aa
Fix typo
2020-08-09 22:34:03 +02:00
Ines Montani
8d2baa153d
Update tokenizer docs and add test
2020-08-09 15:24:01 +02:00
Ines Montani
3901b088ff
Update graphics and 101 [ci skip]
2020-08-07 17:14:13 +02:00
Ines Montani
5e1421e5a6
Update docs [ci skip]
2020-08-07 16:23:12 +02:00
Ines Montani
b7e34c1451
Update docs [ci skip]
2020-08-07 16:13:13 +02:00
Ines Montani
e829d3bf14
Update docs [ci skip]
2020-08-07 15:46:20 +02:00
svlandeg
824f4b2107
casing consistent
2020-08-06 23:20:13 +02:00
Ines Montani
e5995904d6
Update docs
2020-08-06 19:30:43 +02:00
Ines Montani
5d417d3b19
WIP: Update docs [ci skip]
2020-08-06 13:10:15 +02:00
Ines Montani
06e80d95cd
Sync develop with nightly docs state ( #5883 )
...
Co-authored-by: svlandeg <sofie.vanlandeghem@gmail.com>
2020-08-06 00:28:14 +02:00
Ines Montani
50311a4d37
Update docs [ci skip]
2020-08-05 20:29:53 +02:00
Ines Montani
cdec46493f
Update docs
2020-08-05 15:00:54 +02:00
Ines Montani
4c055f0aa7
Add init CLI and init config ( #5854 )
...
* Add init CLI and init config draft
* Improve config validation
* Auto-format
* Don't export anything in debug config
* Update docs
2020-08-02 15:18:30 +02:00
Ines Montani
b40f44419b
Simplify pipe analysis
...
- remove unused code
- don't print by default
- integrate attrs info into analysis output
2020-08-01 13:40:06 +02:00
Ines Montani
98c6a85c8b
Update docs [ci skip]
2020-07-31 18:55:38 +02:00
Ines Montani
e9e8fa2466
Update docs and types
2020-07-31 17:02:54 +02:00
Ines Montani
160f1a5f94
Update docs [ci skip]
2020-07-31 13:26:39 +02:00
Ines Montani
3449c45fd9
Update docs [ci skip]
2020-07-29 19:48:26 +02:00
Ines Montani
9c80cb673d
Update docs [ci skip]
2020-07-29 19:41:34 +02:00
Ines Montani
9f69afdd1e
Update docs [ci skip]
2020-07-29 19:09:44 +02:00
Ines Montani
7a21775cd0
Merge pull request #5834 from explosion/feature/vectors
2020-07-29 18:49:26 +02:00
Ines Montani
158d8c1e48
Update docs [ci skip]
2020-07-29 18:44:10 +02:00
Matthew Honnibal
f7adc9d3b7
Start rewriting vectors docs
2020-07-29 17:10:06 +02:00
Ines Montani
e0ffe36e79
Update docstrings, docs and types
2020-07-29 11:36:42 +02:00
Ines Montani
d8b519c23c
API docs, docstrings and argument consistency
2020-07-27 18:11:45 +02:00
Ines Montani
7dd53d0964
Fix typo [ci skip]
2020-07-27 00:34:00 +02:00
Ines Montani
7adbaf9a5b
Update docs [ci skip]
2020-07-27 00:29:45 +02:00
Matthew Honnibal
fb5dbe30b5
Trim training 101
2020-07-26 13:43:22 +02:00
Matthew Honnibal
e6a7deb7cc
Edits to the training 101 section
2020-07-26 13:42:08 +02:00
Ines Montani
c288dba8e7
Update docs [ci skip]
2020-07-25 18:51:12 +02:00
Li Zhe
a69eb445dc
fix the wrong hash url in adding-languages.md file ( #5810 )
...
* fix the wrong hash url in adding-languages.md file
change the #101 url hash path to #language-data
* filled in the spaCy Contributor Agreement
filled in the spaCy Contributor Agreement
2020-07-25 13:13:38 +02:00
Adriane Boyd
d3385f4be2
Add Morphology and MorphAnalysis to overview
2020-07-21 13:06:22 +02:00
Ines Montani
644074b954
Merge branch 'develop' into master-tmp
2020-07-20 14:58:04 +02:00
Adriane Boyd
39ebcd9ec9
Refactor Chinese tokenizer configuration ( #5736 )
...
* Refactor Chinese tokenizer configuration
Refactor `ChineseTokenizer` configuration so that it uses a single
`segmenter` setting to choose between character segmentation, jieba, and
pkuseg.
* replace `use_jieba`, `use_pkuseg`, `require_pkuseg` with the setting
`segmenter` with the supported values: `char`, `jieba`, `pkuseg`
* make the default segmenter plain character segmentation `char` (no
additional libraries required)
* Fix Chinese serialization test to use char default
* Warn if attempting to customize other segmenter
Add a warning if `Chinese.pkuseg_update_user_dict` is called when
another segmenter is selected.
2020-07-19 13:34:37 +02:00
Adriane Boyd
cd5af72c9a
Update pkuseg version ( #5774 )
...
* Update pkuseg version in Chinese tokenizer warnings
* Update pkuseg version in `Makefile`
* Remove warning about python3.8 wheels in docs
2020-07-19 11:09:49 +02:00
Ines Montani
872938ec76
Merge pull request #5747 from explosion/feature/refactor-config-args
2020-07-14 00:00:22 +02:00
Ines Montani
5f6f4ff594
Remove object subclassing
2020-07-12 14:03:23 +02:00
Ines Montani
3f948b9c74
Update docs
2020-07-12 12:32:28 +02:00
Ines Montani
7b5717cac3
Merge branch 'develop' into feature/refactor-config-args
2020-07-10 22:50:07 +02:00
Ines Montani
e6a6587a9a
Update projects.md [ci skip]
2020-07-10 22:41:27 +02:00
Ines Montani
f2cd982e7b
Update training.md
2020-07-10 22:34:27 +02:00
Ines Montani
52e9b5b472
Fix formatting
2020-07-09 23:25:58 +02:00
Ines Montani
28cdae898a
Update projects.md
2020-07-09 22:35:54 +02:00
Ines Montani
7bcf9f7cfb
Document new features
2020-07-09 21:10:36 +02:00
Ines Montani
ea01831f6a
Update projects docs etc.
2020-07-09 19:43:25 +02:00
Ines Montani
2298e129e6
Update example and training docs
2020-07-07 20:30:12 +02:00
svlandeg
2b60e894cb
fix component constructors, update, begin_training, reference to GoldParse
2020-07-07 19:17:19 +02:00
Ines Montani
bb3ee38cf9
Update WIP
2020-07-06 22:22:37 +02:00
Ines Montani
44790c1c32
Update docs and add keyword-only tag
2020-07-06 18:14:57 +02:00
Ines Montani
a35236e5f0
Update v3 docs WIP [ci skip]
2020-07-06 15:57:44 +02:00
Ines Montani
63247cbe87
Update v3 docs [ci skip]
2020-07-05 16:11:16 +02:00
Ines Montani
dc8c9d912f
Update docs [ci skip]
2020-07-04 16:47:24 +02:00
Ines Montani
1e0d54edd1
Update docs
2020-07-04 14:23:10 +02:00
Ines Montani
06f1ecb308
Update v3 docs
2020-07-03 16:48:21 +02:00
Ines Montani
b5268955d7
Update matcher usage examples [ci skip]
2020-07-02 15:39:45 +02:00
Ines Montani
fe4cfd0632
Start updating website for v3 [ci skip]
2020-07-01 21:26:39 +02:00
Ines Montani
26df4efa94
Add new in v3.0
2020-07-01 13:02:17 +02:00
Ines Montani
414dc7ace1
Merge branch 'spacy.io' into spacy.io-develop
2020-07-01 11:47:47 +02:00
Matthias Hertel
305221f3e5
Website: fixed the token span in the text about the rule-based matching example ( #5669 )
...
* fixed token span in pattern matcher example
* contributor agreement
2020-06-30 19:58:55 +02:00
Matthias Hertel
8b0f749606
Website: fixed the token span in the text about the rule-based matching example ( #5669 )
...
* fixed token span in pattern matcher example
* contributor agreement
2020-06-30 19:58:23 +02:00
Adriane Boyd
d777d9cc38
Extend v2.3 migration guide ( #5653 )
...
* Extend preloaded vocab section
* Add section on tag maps
2020-06-26 14:13:01 +02:00
Adriane Boyd
c4d0209472
Extend v2.3 migration guide ( #5653 )
...
* Extend preloaded vocab section
* Add section on tag maps
2020-06-26 14:12:29 +02:00
Adriane Boyd
a2660bd9c6
Fix backslashes in warnings config diff ( #5640 )
...
Fix backslashes in warnings config diff in v2.3 migration section.
2020-06-24 10:26:57 +02:00
Adriane Boyd
fd4287c178
Fix backslashes in warnings config diff ( #5640 )
...
Fix backslashes in warnings config diff in v2.3 migration section.
2020-06-24 10:26:12 +02:00
Adriane Boyd
4f73ced914
Extend what's new in v2.3 with vocab / is_oov ( #5635 )
2020-06-23 16:50:43 +02:00
Adriane Boyd
7ce451c211
Extend what's new in v2.3 with vocab / is_oov ( #5635 )
2020-06-23 16:48:59 +02:00
Adriane Boyd
fcdecefacf
Add warnings example in v2.3 migration guide ( #5627 )
2020-06-22 14:38:06 +02:00
Adriane Boyd
bc1cb30b21
Add warnings example in v2.3 migration guide ( #5627 )
2020-06-22 14:37:24 +02:00
Ines Montani
52728d8fa3
Merge branch 'develop' into master-tmp
2020-06-20 15:52:00 +02:00
Adriane Boyd
66889de166
Warning for sudachipy 0.4.5 ( #5611 )
2020-06-19 13:45:23 +02:00
Adriane Boyd
931d80de72
Warning for sudachipy 0.4.5 ( #5611 )
2020-06-19 12:43:41 +02:00
Ines Montani
6d712f3e06
Merge pull request #5599 from adrianeboyd/docs/v2.3.0-minor
2020-06-16 13:49:25 -07:00
Adriane Boyd
02369f91d3
Fix spacy convert argument
2020-06-16 20:41:17 +02:00
Adriane Boyd
f0fd77648f
Change example title to Dr.
...
Change example title to Dr. so the current model does exclude the title
in the initial example.
2020-06-16 20:36:21 +02:00
Adriane Boyd
a6abdfbc3c
Fix numpy.zeros() dtype for Doc.from_array
2020-06-16 20:35:45 +02:00
Adriane Boyd
9aff317ca7
Update POS in tagging example
2020-06-16 20:26:57 +02:00
Adriane Boyd
457babfa0c
Update alignment example for new gold.align
2020-06-16 20:22:03 +02:00
Ines Montani
44af53bdd9
Add pkuseg warnings and auto-format [ci skip]
2020-06-16 17:13:35 +02:00
Adriane Boyd
d5110ffbf2
Documentation updates for v2.3.0 ( #5593 )
...
* Update website models for v2.3.0
* Add docs for Chinese word segmentation
* Tighten up Chinese docs section
* Merge branch 'master' into docs/v2.3.0 [ci skip]
* Merge branch 'master' into docs/v2.3.0 [ci skip]
* Auto-format and update version
* Update matcher.md
* Update languages and sorting
* Typo in landing page
* Infobox about token_match behavior
* Add meta and basic docs for Japanese
* POS -> TAG in models table
* Add info about lookups for normalization
* Updates to API docs for v2.3
* Update adding norm exceptions for adding languages
* Add --omit-extra-lookups to CLI API docs
* Add initial draft of "What's New in v2.3"
* Add new in v2.3 tags to Chinese and Japanese sections
* Add tokenizer to migration section
* Add new in v2.3 flags to init-model
* Typo
* More what's new in v2.3
Co-authored-by: Ines Montani <ines@ines.io>
2020-06-16 15:37:35 +02:00
Ines Montani
810fce3bb1
Merge branch 'develop' into master-tmp
2020-06-03 14:36:59 +02:00
Ines Montani
262d306eaa
unicode -> str consistency
2020-05-24 17:23:00 +02:00
Ines Montani
5d3806e059
unicode -> str consistency
2020-05-24 17:20:58 +02:00
Jannis
aa53ce6996
Documentation Typo Fix ( #5492 )
...
* Fix typo
Change 'realize' to 'realise'
* Add contributer agreement
2020-05-22 19:50:26 +02:00
Adriane Boyd
e4a1b5dab1
Rename to url_match
...
Rename to `url_match` and update docs.
2020-05-22 12:41:03 +02:00
Adriane Boyd
730fa493a4
Merge remote-tracking branch 'upstream/master' into bugfix/revert-token-match
2020-05-22 12:18:00 +02:00
Ines Montani
24f72c669c
Merge branch 'develop' into master-tmp
2020-05-21 18:39:06 +02:00
Sofie Van Landeghem
0d94737857
Feature toggle_pipes ( #5378 )
...
* make disable_pipes deprecated in favour of the new toggle_pipes
* rewrite disable_pipes statements
* update documentation
* remove bin/wiki_entity_linking folder
* one more fix
* remove deprecated link to documentation
* few more doc fixes
* add note about name change to the docs
* restore original disable_pipes
* small fixes
* fix typo
* fix error number to W096
* rename to select_pipes
* also make changes to the documentation
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2020-05-18 22:27:10 +02:00
Ines Montani
f333c2a011
Merge pull request #5386 from svlandeg/fix/nel-docs
2020-05-10 12:00:09 +02:00
adrianeboyd
4a15b559ba
Clarify Token.pos as UPOS ( #5419 )
2020-05-08 10:36:25 +02:00
Adriane Boyd
792c8af8cf
Merge remote-tracking branch 'upstream/master' into bugfix/revert-token-match
2020-05-05 09:25:57 +02:00
svlandeg
ebaed7dcfa
Few more updates to the EL documentation
2020-04-30 10:17:06 +02:00
Sofie Van Landeghem
cfdaf99b80
Fix passing of component configuration ( #5374 )
...
* add kwargs to to_disk methods in docs - otherwise crashes on 'exclude' argument
* add fix and test for Issue 5137
2020-04-29 12:56:17 +02:00
Sofie Van Landeghem
f67343295d
Update NEL examples and documentation ( #5370 )
...
* simplify creation of KB by skipping dim reduction
* small fixes to train EL example script
* add KB creation and NEL training example scripts to example section
* update descriptions of example scripts in the documentation
* moving wiki_entity_linking folder from bin to projects
* remove test for wiki NEL functionality that is being moved
2020-04-29 12:53:53 +02:00
adrianeboyd
90ce34db42
Add cuda101 and cuda102 options to setup ( #5377 )
...
* Add cuda101 and cuda102 options to setup
* Update cudaNNN options in docs
2020-04-29 12:51:12 +02:00
Mike
481574cbc8
[minor doc change] embedding vis. link is broken in website/docs/usage/examples.md
( #5325 )
...
* The embedding vis. link is broken
The first link seems to be reasonable for now unless someone has an updated embedding vis they want to share?
* contributor agreement
* Update Mlawrence95.md
* Update website/docs/usage/examples.md
Co-Authored-By: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2020-04-21 20:35:12 +02:00
Sofie Van Landeghem
1137420840
Small doc fixes ( #5250 )
...
* fix link
* torchtext instead tochtext
2020-04-03 13:01:43 +02:00
Sofie Van Landeghem
9b412516e7
Fixing pickling of the parser ( #5218 )
...
* fix __reduce__ for pickling parser
* setting the move object as 'state' during pickling
* unskip test_issue4725 - works again
2020-03-27 19:35:26 +01:00
Ines Montani
46568f40a7
Merge branch 'master' into tmp/sync
2020-03-26 13:38:14 +01:00
Tiljander
e53232533b
Describing priority rules for overlapping matches ( #5197 )
...
* Describing priority rules for overlapping matches
* Create Tiljander.md
* Describing priority rules for overlapping matches
* Update website/docs/api/entityruler.md
Co-Authored-By: Ines Montani <ines@ines.io>
Co-authored-by: Ines Montani <ines@ines.io>
2020-03-26 13:13:22 +01:00
adrianeboyd
d88a377bed
Remove Vectors.from_glove ( #5209 )
2020-03-26 10:45:47 +01:00
Ines Montani
17bd9ed84f
Merge pull request #5153 from pinealan/fix/website-docs
...
Fix website typos and weird sentences
2020-03-16 15:03:01 +01:00
Alan Chan
36e3532475
Remove unfinished sentence
2020-03-15 03:45:17 +08:00
Mark Abraham
a0ffa346c0
Fix broken link in docs
2020-03-13 14:07:26 +01:00
Renaud Richardet
eccf6b1686
small typo in code sample
2020-03-09 14:49:11 +01:00
Adriane Boyd
0c31f03ec5
Update docs [ci skip]
2020-03-09 13:41:17 +01:00
Adriane Boyd
1139247532
Revert changes to token_match priority from #4374
...
* Revert changes to priority of `token_match` so that it has priority
over all other tokenizer patterns
* Add lookahead and potentially slow lookbehind back to the default URL
pattern
* Expand character classes in URL pattern to improve matching around
lookaheads and lookbehinds related to #4882
* Revert changes to Hungarian tokenizer
* Revert (xfail) several URL tests to their status before #4374
* Update `tokenizer.explain()` and docs accordingly
2020-03-09 12:09:41 +01:00
Ines Montani
de11ea753a
Merge branch 'master' into develop
2020-02-18 14:47:23 +01:00
Kabir Khan
f6ed07b85c
Use nlp.pipe in EntityRuler for phrase patterns in add_patterns ( #4931 )
...
* Fix ent_ids and labels properties when id attribute used in patterns
* use set for labels
* sort end_ids for comparison in entity_ruler tests
* fixing entity_ruler ent_ids test
* add to set
* Run make_doc optimistically if using phrase matcher patterns.
* remove unused coveragerc I was testing with
* format
* Refactor EntityRuler.add_patterns to use nlp.pipe for phrase patterns. Improves speed substantially.
* Removing old add_patterns function
* Fixing spacing
* Make sure token_patterns loaded as well, before generator was being emptied in from_disk
2020-02-16 18:17:47 +01:00
Julin S
479e81bafc
fix link ( #4977 )
2020-02-10 20:31:26 -05:00
Ines Montani
9c08d9baa3
Remove old sections [ci skip] ( closes #4961 )
2020-02-03 13:10:46 +01:00
Preston Badeer
b216ff43c9
Update vectors-similarity.md ( #4889 )
...
These links are broken on the website, due to quotes around the URLs.
2020-01-08 16:49:40 +01:00
Geoffrey Gordon Ashbrook
53929138d7
remove extra word typo ( #4875 )
...
"let you find you"
2020-01-06 12:37:42 +01:00
Ines Montani
400257a802
Update index.md [ci skip]
2020-01-04 01:52:18 +01:00
Ines Montani
db55577c45
Drop Python 2.7 and 3.5 ( #4828 )
...
* Remove unicode declarations
* Remove Python 3.5 and 2.7 from CI
* Don't require pathlib
* Replace compat helpers
* Remove OrderedDict
* Use f-strings
* Set Cython compiler language level
* Fix typo
* Re-add OrderedDict for Table
* Update setup.cfg
* Revert CONTRIBUTING.md
* Revert lookups.md
* Revert top-level.md
* Small adjustments and docs [ci skip]
2019-12-22 01:53:56 +01:00
Ines Montani
158b98a3ef
Merge branch 'master' into develop
2019-12-21 18:55:03 +01:00