From: A Chinese telemedicine-dialogue dataset annotated for named entities
This dataset | IMCS-NER | |
---|---|---|
Count of all named entities | 63,560 | 74,698 |
Average length of entity | 4.33 | 2.63 |
Count of total characters | 1,700,392 | 1,621,161 |
Ratio of tagged characters to total ones | 16.2% | 12.1% |
Average count of characters per consultation | 713.55 | 589.04 |