Gyorgy Orosz
|
d9c59c4751
|
Maintaining backward compatibility.
|
2016-12-21 23:30:49 +01:00 |
|
Gyorgy Orosz
|
1748549aeb
|
Added exception pattern mechanism to the tokenizer.
|
2016-12-21 23:16:19 +01:00 |
|
Gyorgy Orosz
|
35aa54765d
|
Hungarian module is exposed in spacy.
|
2016-12-21 20:45:36 +01:00 |
|
Gyorgy Orosz
|
ab2f6ea46c
|
Removed data files from tests..
|
2016-12-21 20:22:09 +01:00 |
|
Ines Montani
|
3c87c71d43
|
Add tokenizer exceptions for a.m. and p.m. in Spanish
|
2016-12-21 18:19:10 +01:00 |
|
Ines Montani
|
d1a2846750
|
Document DET_LEMMA
|
2016-12-21 18:18:35 +01:00 |
|
Ines Montani
|
78e63dc7d0
|
Update tokenizer exceptions for English
|
2016-12-21 18:06:34 +01:00 |
|
Ines Montani
|
702d1eed93
|
Update tokenizer exceptions for German
|
2016-12-21 18:06:27 +01:00 |
|
Ines Montani
|
d60380418e
|
Update tokenizer exceptions for Spanish
|
2016-12-21 18:06:17 +01:00 |
|
Ines Montani
|
920fa0fed2
|
Add DET_LEMMA constant
|
2016-12-21 18:05:41 +01:00 |
|
Ines Montani
|
8978806ea6
|
Allow Vocab to load without serializer_freqs
|
2016-12-21 18:05:23 +01:00 |
|
Ines Montani
|
be8ed811f6
|
Remove trailing whitespace
|
2016-12-21 18:04:41 +01:00 |
|
Ines Montani
|
926e19184a
|
Merge pull request #695 from magnusburton/master
Added Swedish morph rules
|
2016-12-21 01:06:00 +01:00 |
|
Ines Montani
|
71c00db8a5
|
Update language models page
|
2016-12-21 00:54:54 +01:00 |
|
Gyorgy Orosz
|
3d5306acb9
|
Added further testcases.
|
2016-12-20 23:49:35 +01:00 |
|
Gyorgy Orosz
|
23956e72ff
|
Improved partial support for tokenzing Hungarian numbers
|
2016-12-20 23:36:59 +01:00 |
|
Matthew Honnibal
|
5a319060b9
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2016-12-20 16:26:57 -06:00 |
|
Matthew Honnibal
|
7793e2ad82
|
Fix use of dropout in sentiment analysis LSTM example
|
2016-12-20 16:26:38 -06:00 |
|
Gyorgy Orosz
|
6add156075
|
Refactored language data structure
|
2016-12-20 22:28:20 +01:00 |
|
Matthew Honnibal
|
6aed94a3b9
|
Merge pull request #698 from aikramer2/master
update to training doc
|
2016-12-21 07:46:51 +11:00 |
|
aikramer2
|
349143faa2
|
update to training doc
|
2016-12-20 12:01:16 -08:00 |
|
Gyorgy Orosz
|
366b3f8685
|
Merge branch 'master' into hu_tokenizer
|
2016-12-20 20:53:31 +01:00 |
|
Gyorgy Orosz
|
c035928156
|
Partial Hungarian number tokenization is added.
|
2016-12-20 20:46:20 +01:00 |
|
Matthew Honnibal
|
f3c61ae246
|
Merge pull request #697 from jmizgajski/patch-1
Fixed missing vec_path declaration that was failing if 'add_vectors' …
|
2016-12-21 04:32:43 +11:00 |
|
JM
|
70ff0639b5
|
Fixed missing vec_path declaration that was failing if 'add_vectors' was set
Added vec_path variable declaration to avoid accessing it before assignment in case 'add_vectors' is in overrides.
|
2016-12-20 18:21:05 +01:00 |
|
Magnus Burton
|
48dcc9f647
|
Added morph rules
|
2016-12-20 13:18:41 +01:00 |
|
Ines Montani
|
9a0748b12d
|
Update CONTRIBUTORS.md
|
2016-12-20 12:01:33 +01:00 |
|
Ines Montani
|
68abc16c9a
|
Merge pull request #694 from magnusburton/master
Initial commit for Swedish
|
2016-12-20 12:00:55 +01:00 |
|
Magnus Burton
|
db5a077d2b
|
Initial commit for Swedish
|
2016-12-20 11:05:06 +01:00 |
|
Ines Montani
|
a2525c76ee
|
Reformat word frequencies section in "adding languages" workflow
|
2016-12-19 17:18:38 +01:00 |
|
Ines Montani
|
ddf5c5bb61
|
Generalise dependency parsing annotation specs beyond English (closes #657)
|
2016-12-19 13:42:44 +01:00 |
|
Ines Montani
|
6a793251c8
|
Add aside on spaCy's custom pronoun lemma
|
2016-12-19 13:41:47 +01:00 |
|
Ines Montani
|
d0c15730c4
|
Fix link
|
2016-12-19 13:09:45 +01:00 |
|
Ines Montani
|
a9c0e77b80
|
Fix typo
|
2016-12-19 13:09:45 +01:00 |
|
Matthew Honnibal
|
0c7720e162
|
Remove unit and integration test packages
|
2016-12-19 00:26:56 +01:00 |
|
Matthew Honnibal
|
6c0c43c267
|
Add comment
|
2016-12-19 00:20:16 +01:00 |
|
Matthew Honnibal
|
6c7ae55084
|
Merge branch 'master' of ssh://github.com/explosion/spaCy
|
2016-12-19 00:15:26 +01:00 |
|
Matthew Honnibal
|
b2cebdcca7
|
List more test packages in the setup.py
|
2016-12-19 00:15:11 +01:00 |
|
Matthew Honnibal
|
97521c95b3
|
List the language_data package in the setup.py
|
2016-12-19 00:14:09 +01:00 |
|
Ines Montani
|
c773258ebf
|
Update README.rst
|
2016-12-18 23:54:39 +01:00 |
|
Ines Montani
|
fa65c6b54c
|
Add "Adding languages" workflow (closes #562)
|
2016-12-18 23:54:19 +01:00 |
|
Ines Montani
|
1cddb7da36
|
Add "Part-of-speech tagging" workflow (closes #581)
|
2016-12-18 23:54:19 +01:00 |
|
Ines Montani
|
89398ca57b
|
Bump version
|
2016-12-18 23:54:19 +01:00 |
|
Ines Montani
|
ac597b58f6
|
Update showcase
|
2016-12-18 23:54:18 +01:00 |
|
Matthew Honnibal
|
3f5747a9b2
|
Merge branch 'master' of ssh://github.com/explosion/spaCy
|
2016-12-18 23:44:22 +01:00 |
|
Matthew Honnibal
|
40e71586d6
|
Fix Issue #683: Add 'SP' to tag_map, if it's not there already, within the Morphology class.
|
2016-12-18 23:44:05 +01:00 |
|
Matthew Honnibal
|
fa1d23e10d
|
Merge branch 'master' of https://github.com/explosion/spaCy
|
2016-12-18 23:32:03 +01:00 |
|
Matthew Honnibal
|
f38eb25fe1
|
Fix test for word vector
|
2016-12-18 23:31:55 +01:00 |
|
Matthew Honnibal
|
4e68abebc4
|
Merge branch 'master' of ssh://github.com/explosion/spaCy
|
2016-12-18 23:19:45 +01:00 |
|
Matthew Honnibal
|
5a6328a5a4
|
Increment version
|
2016-12-18 23:19:19 +01:00 |
|