Classify and extract text 10x better and faster 🦾


➡️  Learn more

BioCreative II Gene Mention Recognition (BC2GM) Dataset

Created by Smith et al. at 2008, the BioCreative II Gene Mention Recognition (BC2GM) Dataset contains data where participants are asked to identify a gene mention in a sentence by giving its start and end characters. The training set consists of a set of sentences, and for each sentence a set of gene mentions (GENE annotations). [registration required for access], in English language. Containing 20 in n/a file format.

Dataset Sources

Here you can download the BioCreative II Gene Mention Recognition (BC2GM) dataset in n/a format.

Download BioCreative II Gene Mention Recognition (BC2GM) dataset n/a files

Fine-tune with BioCreative II Gene Mention Recognition (BC2GM) dataset

Metatext is a powerful no-code tool for train, tune and integrate custom NLP models

➡️  Learn more

Paper

Read full original BioCreative II Gene Mention Recognition (BC2GM) paper.

Download PDF paper


Classify and extract text 10x better and faster 🦾

Metatext helps you to classify and extract information from text and documents with customized language models with your data and expertise.