Tagging publications is painful, and we can help

In Citation tracking, Tools by Xu CuiLeave a Comment

“I used to tag our citations by products, applications, research fields, and technologies. But I stopped it because it took too much time!

We have heard this repeatedly recently from a number of biotech companies. Tagging citations is valuable because it allows you to show relevant citations for different purposes. But tagging citations is tedious and takes a lot of time.

To address this pain, we have developed 3 methods for automatic tagging:

1. Phrase matching

Our first method is to match the tagging terms to each citation. For example, let’s say we want to tag the citations by research field, and “Neuroscience” is one of the options. If we find the term “Neuroscience” in a citations’ title, abstract, or snippet, then we will tag this citation as “Neuroscience“.

For example, the paper titled “Application of CRISPR-Cas systems in neuroscience” will be tagged as “Neuroscience“.

2. Semantic matching

It is often the case where our tag terms do not appear in a citation, but they are semantically related. Let’s take a look at the following citation titled:

“Amygdala-Insula Circuit Computations in Posttraumatic Stress Disorder.”

This citation should be tagged as “Neuroscience” but the term “Neuroscience” is absent from the title or abstract. To solve this issue, we developed a semantic aware AI. This AI is based on one of the world’s most powerful machine learning models and has been trained with articles with hundred billions of words. It understands the semantic relationship between texts very well. We use this AI to calculate the semantic similarity score between a citation and the tag terms. Terms with high score will be used to tag a citation.

3. Dictionary mapping

We frequently notice that researchers cited a product name in various formats. For example, a product called “ExoQuick-TC” can be cited as “Exo-Quick-TC”, “Exo Quick TC”, “ExoQuick TC”, etc. To correctly tag the citations, we can create a dictionary, mapping various terms to a single, standard term.

This method can also be used to tag research fields, applications, etc. For example, if one of the application tags is “Animal“, then we can create a dictionary mapping mouse, mice, rat, rats, pig, pigs etc. to “Animal“.

In summary, the 3 methods listed above can be used to tag citations with high accuracy. If you are spending too much time tagging your citations manually, please let us know.

If you find the article useful, you may consider to subscribe:

Leave a Comment