Matthew Honnibal
|
2154a54f6b
|
* Add spacy.de
|
2015-09-06 21:56:47 +02:00 |
|
Matthew Honnibal
|
f6ec5bf1b0
|
* Use empty tag map in vocab if none supplied
|
2015-09-06 20:19:27 +02:00 |
|
Matthew Honnibal
|
4f8e38271d
|
* Fix merge errors in lexeme.pxd
|
2015-09-06 20:19:08 +02:00 |
|
Matthew Honnibal
|
86c888667f
|
* Merge in changes from de branch
|
2015-09-06 19:49:28 +02:00 |
|
Matthew Honnibal
|
d2fc104a26
|
* Begin merge of Gazetteer and DE branches
|
2015-09-06 19:45:15 +02:00 |
|
Matthew Honnibal
|
dbf8dce109
|
Merge branch 'gaz' of ssh://github.com/honnibal/spaCy into gaz
|
2015-09-06 18:44:14 +02:00 |
|
Matthew Honnibal
|
9eae9837c4
|
* Fix morphology look up
|
2015-09-06 17:53:39 +02:00 |
|
Matthew Honnibal
|
6427a3fcac
|
* Temporarily import flag attributes in matcher
|
2015-09-06 17:53:12 +02:00 |
|
Matthew Honnibal
|
7cc56ada6e
|
* Temporarily add py_set_flag attribute in Lexeme
|
2015-09-06 17:52:51 +02:00 |
|
Matthew Honnibal
|
e35bb36be7
|
* Ensure Lexeme.check_flag returns a boolean value
|
2015-09-06 17:52:32 +02:00 |
|
Matthew Honnibal
|
7e4fea67d3
|
* Fix bug in token subtree, introduced by duplication of L/R code in Stateclass. Need to consolidate the two methods.
|
2015-09-06 10:48:36 +02:00 |
|
Matthew Honnibal
|
5edac11225
|
* Wrap self.parse in nogil, and break if an invalid move is predicted. The invalid break is a work-around that papers over likely bugs, but we can't easily break in the nogil block, and otherwise we'll get an infinite loop. Need to set this as an error flag.
|
2015-09-06 04:15:00 +02:00 |
|
Matthew Honnibal
|
fd1eeb3102
|
* Add POS attribute support in get_attr
|
2015-09-06 04:13:03 +02:00 |
|
Matthew Honnibal
|
534e3dda3c
|
* More work on language independent parsing
|
2015-08-28 03:44:54 +02:00 |
|
Matthew Honnibal
|
c2307fa9ee
|
* More work on language-generic parsing
|
2015-08-28 02:02:33 +02:00 |
|
Matthew Honnibal
|
86c4a8e3e2
|
* Work on new morphology organization
|
2015-08-27 23:11:51 +02:00 |
|
Matthew Honnibal
|
5b89e2454c
|
* Improve error-reporting in tagger
|
2015-08-27 10:26:36 +02:00 |
|
Matthew Honnibal
|
f0a7c99554
|
* Relax rule-requirement in lemmatizer
|
2015-08-27 10:26:19 +02:00 |
|
Matthew Honnibal
|
0af139e183
|
* Tagger training now working. Still need to test load/save of model. Morphology still broken.
|
2015-08-27 09:16:11 +02:00 |
|
Matthew Honnibal
|
1302d35dff
|
* Rework interfaces in vocab
|
2015-08-26 19:21:46 +02:00 |
|
Matthew Honnibal
|
2d521768a3
|
* Store Morphology class in Vocab
|
2015-08-26 19:21:03 +02:00 |
|
Matthew Honnibal
|
d30029979e
|
* Avoid import of morphology in spans
|
2015-08-26 19:20:46 +02:00 |
|
Matthew Honnibal
|
119c0f8c3f
|
* Hack out morphology stuff from tokenizer, while morphology being reimplemented.
|
2015-08-26 19:20:11 +02:00 |
|
Matthew Honnibal
|
b4faf551f5
|
* Refactor language-independent tagger class
|
2015-08-26 19:19:21 +02:00 |
|
Matthew Honnibal
|
a3d5e6c0dd
|
* Reform constructor and save/load workflow in parser model
|
2015-08-26 19:19:01 +02:00 |
|
Matthew Honnibal
|
1d7f2d3abc
|
* Hack on morphology structs
|
2015-08-26 19:18:36 +02:00 |
|
Matthew Honnibal
|
f8f2f4e545
|
* Temporarily add PUNC name to parts_of_specch dictionary, until better solution
|
2015-08-26 19:18:19 +02:00 |
|
Matthew Honnibal
|
008b02b035
|
* Comment out enums in Morpohlogy for now
|
2015-08-26 19:17:35 +02:00 |
|
Matthew Honnibal
|
378729f81a
|
* Hack Morphology class towards usability
|
2015-08-26 19:17:21 +02:00 |
|
Matthew Honnibal
|
430affc347
|
* Fix missing n_patterns property in Matcher class. Fix from_dir method
|
2015-08-26 19:17:02 +02:00 |
|
Matthew Honnibal
|
3acf60df06
|
* Add missing properties in Lexeme class
|
2015-08-26 19:16:28 +02:00 |
|
Matthew Honnibal
|
76996f4145
|
* Hack on generic Language class. Still needs work for morphology, defaults, etc
|
2015-08-26 19:16:09 +02:00 |
|
Matthew Honnibal
|
e2ef78b29c
|
* Gut pos.pyx module, since functionality moved to spacy/tagger.pyx
|
2015-08-26 19:15:42 +02:00 |
|
Matthew Honnibal
|
c4d8754385
|
* Specify LOCAL_DATA_DIR global in spacy.en.__init__.py
|
2015-08-26 19:15:07 +02:00 |
|
Matthew Honnibal
|
c2d8edd0bd
|
* Add PROB attribute in attrs.pxd
|
2015-08-26 19:14:19 +02:00 |
|
Matthew Honnibal
|
c5a27d1821
|
* Move lemmatizer to spacy
|
2015-08-25 15:47:08 +02:00 |
|
Matthew Honnibal
|
82217c6ec6
|
* Generalize lemmatizer
|
2015-08-25 15:46:19 +02:00 |
|
Matthew Honnibal
|
8083a07c3e
|
* Use language base class
|
2015-08-25 15:37:30 +02:00 |
|
Matthew Honnibal
|
f2f699ac18
|
* Add language base class
|
2015-08-25 15:37:17 +02:00 |
|
Matthew Honnibal
|
5dd76be446
|
* Split EnPosTagger up into base class and subclass
|
2015-08-24 05:25:55 +02:00 |
|
Matthew Honnibal
|
5d5922dbfa
|
* Begin laying out morphological features
|
2015-08-24 01:04:30 +02:00 |
|
Matthew Honnibal
|
6f1743692a
|
* Work on language-independent refactoring
|
2015-08-23 20:49:18 +02:00 |
|
Matthew Honnibal
|
3879d28457
|
* Fix https for url detection
|
2015-08-23 02:40:35 +02:00 |
|
Matthew Honnibal
|
cad0cca4e3
|
* Tmp
|
2015-08-22 22:04:34 +02:00 |
|
Matthew Honnibal
|
bf38b3b883
|
* Hack on l/r reversal bug
|
2015-08-10 05:58:43 +02:00 |
|
Matthew Honnibal
|
6116413b47
|
* Fix label prediction in StepwiseState
|
2015-08-10 05:05:31 +02:00 |
|
Matthew Honnibal
|
2c9753eff2
|
* Whitespace
|
2015-08-10 00:09:02 +02:00 |
|
Matthew Honnibal
|
9de98f5a6f
|
* Add Parser.stepthrough method, with context manager
|
2015-08-10 00:08:46 +02:00 |
|
Matthew Honnibal
|
fe43f8cf39
|
* Whitespace
|
2015-08-09 02:31:53 +02:00 |
|
Matthew Honnibal
|
9c090945e0
|
* Add Parser.predict method, and clean up Parser.get_state
|
2015-08-09 02:29:58 +02:00 |
|