mirror of
				https://github.com/explosion/spaCy.git
				synced 2025-10-31 07:57:35 +03:00 
			
		
		
		
	
		
			
				
	
	
	
		
			1.1 KiB
		
	
	
	
	
	
	
	
			
		
		
	
	
			1.1 KiB
		
	
	
	
	
	
	
	
| title | teaser | tag | source | new | 
|---|---|---|---|---|
| GoldCorpus | An annotated corpus, using the JSON file format | class | spacy/gold.pyx | 2 | 
This class manages annotations for tagging, dependency parsing and NER.
GoldCorpus.__init__
Create a GoldCorpus. IF the input data is an iterable, each item should be a
(text, paragraphs) tuple, where each paragraph is a tuple
(sentences, brackets), and each sentence is a tuple
(ids, words, tags, heads, ner). See the implementation of
gold.read_json_file
for further details.
| Name | Type | Description | 
|---|---|---|
| train | str / Path/ iterable | Training data, as a path (file or directory) or iterable. | 
| dev | str / Path/ iterable | Development data, as a path (file or directory) or iterable. | 
| RETURNS | GoldCorpus | The newly constructed object. |