mirror of
				https://github.com/explosion/spaCy.git
				synced 2025-11-04 01:48:04 +03:00 
			
		
		
		
	
		
			
				
	
	
	
		
			1.1 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			1.1 KiB
		
	
	
	
	
	
	
	
| title | teaser | tag | source | new | 
|---|---|---|---|---|
| GoldCorpus | An annotated corpus, using the JSON file format | class | spacy/gold.pyx | 2 | 
This class manages annotations for tagging, dependency parsing and NER.
GoldCorpus.__init__
Create a GoldCorpus. IF the input data is an iterable, each item should be a
(text, paragraphs) tuple, where each paragraph is a tuple
(sentences, brackets), and each sentence is a tuple
(ids, words, tags, heads, ner). See the implementation of
gold.read_json_file
for further details.
| Name | Type | Description | 
|---|---|---|
train | 
str / Path / iterable | 
Training data, as a path (file or directory) or iterable. | 
dev | 
str / Path / iterable | 
Development data, as a path (file or directory) or iterable. | 
| RETURNS | GoldCorpus | 
The newly constructed object. |