Commit Graph

4341 Commits

Author SHA1 Message Date
Ines Montani
f6f6e028ea Make links detect target automatically and replace false with null for no attribute
New version of Harp would render attribute=false as attribute="false",
while attribute=null renders element without attribute.
2016-12-24 12:24:04 +01:00
Ines Montani
b893126c12 Use link mixin instead of plain link markup 2016-12-24 12:22:52 +01:00
Ines Montani
8785706039 Reformat stop words for better readability 2016-12-24 00:58:40 +01:00
Gyorgy Orosz
45e045a87b Unicode/UTF8 compatibility for Python2 2016-12-24 00:21:00 +01:00
Gyorgy Orosz
72b61b6d03 Typo fix. 2016-12-24 00:10:29 +01:00
Gyorgy Orosz
3a9be4d485 Updated token exception handling mechanism to allow the usage of arbitrary functions as token exception matchers. 2016-12-23 23:49:34 +01:00
Ines Montani
207555fae7 Fix spelling 2016-12-23 21:36:01 +01:00
Ines Montani
1436b9f15a Fix formatting and consistency 2016-12-23 21:36:01 +01:00
Ines Montani
1d64527727 Update Spanish tokenizer
Remove reflexive pronouns as they're part of an open class, fix
mistakes and add exceptions
2016-12-23 21:36:01 +01:00
Ines Montani
12bb0aa3e3 Fix license formatting for GitHub's parser 2016-12-23 15:05:03 +01:00
Ines Montani
48b03b4001 Fix formatting and wording 2016-12-23 14:36:03 +01:00
Ines Montani
cc051ddc15 Add resources page to usage docs 2016-12-23 14:36:03 +01:00
Ines Montani
11ec02d5e3 Separate inline icon and help cursor classes 2016-12-23 14:36:03 +01:00
Ines Montani
7f411fd01c Remove exceptions containing whitespace / no special chars 2016-12-23 14:30:06 +01:00
Magnus Burton
fdf4776262 Added Swedish abbreviations 2016-12-22 22:45:18 +01:00
Ines Montani
642803d533 Merge pull request #702 from fnorf/patch-1
fixed minor typo
2016-12-22 13:06:56 +01:00
Hannes
c5c0ed9af8 fixed minor typo
Peformance -> Performance
2016-12-22 13:02:56 +01:00
Gyorgy Orosz
d9c59c4751 Maintaining backward compatibility. 2016-12-21 23:30:49 +01:00
Gyorgy Orosz
1748549aeb Added exception pattern mechanism to the tokenizer. 2016-12-21 23:16:19 +01:00
Gyorgy Orosz
35aa54765d Hungarian module is exposed in spacy. 2016-12-21 20:45:36 +01:00
Gyorgy Orosz
ab2f6ea46c Removed data files from tests.. 2016-12-21 20:22:09 +01:00
Ines Montani
3c87c71d43 Add tokenizer exceptions for a.m. and p.m. in Spanish 2016-12-21 18:19:10 +01:00
Ines Montani
d1a2846750 Document DET_LEMMA 2016-12-21 18:18:35 +01:00
Ines Montani
78e63dc7d0 Update tokenizer exceptions for English 2016-12-21 18:06:34 +01:00
Ines Montani
702d1eed93 Update tokenizer exceptions for German 2016-12-21 18:06:27 +01:00
Ines Montani
d60380418e Update tokenizer exceptions for Spanish 2016-12-21 18:06:17 +01:00
Ines Montani
920fa0fed2 Add DET_LEMMA constant 2016-12-21 18:05:41 +01:00
Ines Montani
8978806ea6 Allow Vocab to load without serializer_freqs 2016-12-21 18:05:23 +01:00
Ines Montani
be8ed811f6 Remove trailing whitespace 2016-12-21 18:04:41 +01:00
Ines Montani
926e19184a Merge pull request #695 from magnusburton/master
Added Swedish morph rules
2016-12-21 01:06:00 +01:00
Ines Montani
71c00db8a5 Update language models page 2016-12-21 00:54:54 +01:00
Gyorgy Orosz
3d5306acb9 Added further testcases. 2016-12-20 23:49:35 +01:00
Gyorgy Orosz
23956e72ff Improved partial support for tokenzing Hungarian numbers 2016-12-20 23:36:59 +01:00
Matthew Honnibal
5a319060b9 Merge branch 'master' of https://github.com/explosion/spaCy 2016-12-20 16:26:57 -06:00
Matthew Honnibal
7793e2ad82 Fix use of dropout in sentiment analysis LSTM example 2016-12-20 16:26:38 -06:00
Gyorgy Orosz
6add156075 Refactored language data structure 2016-12-20 22:28:20 +01:00
Matthew Honnibal
6aed94a3b9 Merge pull request #698 from aikramer2/master
update to training doc
2016-12-21 07:46:51 +11:00
aikramer2
349143faa2 update to training doc 2016-12-20 12:01:16 -08:00
Gyorgy Orosz
366b3f8685 Merge branch 'master' into hu_tokenizer 2016-12-20 20:53:31 +01:00
Gyorgy Orosz
c035928156 Partial Hungarian number tokenization is added. 2016-12-20 20:46:20 +01:00
Matthew Honnibal
f3c61ae246 Merge pull request #697 from jmizgajski/patch-1
Fixed missing vec_path declaration that was failing if 'add_vectors' …
2016-12-21 04:32:43 +11:00
JM
70ff0639b5 Fixed missing vec_path declaration that was failing if 'add_vectors' was set
Added vec_path variable declaration to avoid accessing it before assignment in case 'add_vectors' is in overrides.
2016-12-20 18:21:05 +01:00
Magnus Burton
48dcc9f647 Added morph rules 2016-12-20 13:18:41 +01:00
Ines Montani
9a0748b12d Update CONTRIBUTORS.md 2016-12-20 12:01:33 +01:00
Ines Montani
68abc16c9a Merge pull request #694 from magnusburton/master
Initial commit for Swedish
2016-12-20 12:00:55 +01:00
Magnus Burton
db5a077d2b Initial commit for Swedish 2016-12-20 11:05:06 +01:00
Ines Montani
a2525c76ee Reformat word frequencies section in "adding languages" workflow 2016-12-19 17:18:38 +01:00
Ines Montani
ddf5c5bb61 Generalise dependency parsing annotation specs beyond English (closes #657) 2016-12-19 13:42:44 +01:00
Ines Montani
6a793251c8 Add aside on spaCy's custom pronoun lemma 2016-12-19 13:41:47 +01:00
Ines Montani
d0c15730c4 Fix link 2016-12-19 13:09:45 +01:00