| Annotators' home |
K- and N-ras mutations
meaning K-ras mutations and N-ras mutations;
K-ras and N-ras are the entity references
CYP1A1/2
CYP1A1 and CYP1A2
H-, K-, and N-ras
H-ras, K-ras, and N-ras
androgen and estrogen receptors
androgen receptors and estrogen receptors
left- or right-handed sugar
left-handed sugar or right-handed sugar
Use the chain tool only in entity annotation, and only for split references, except in info-gathering mode. See "What to Chain", below.
Previously we had to tag the parts of a split reference (in the examples, "K-" and "ras", and "CYP1A" and "2") separately and note their relationship in the Comment field (archive), which is not accessible to the machine learning algorithms or the data retrieval software and only serves as a guide for other annotators in subsequent stages. But now we can make the connection explicit in the actual annotation by building a chain of strings and applying a single tag to the whole chain.
(NOTE: WF will allow you to specify links in any order, but don't do it! That will mess up some other features that Eric and Jeremy are working on, as well as untagging [#7, below].)
It wouldn't hurt anything to tag "N-ras" or "CYP1A1" as a chain, but it's a couple of extra steps. Save yourself the trouble.
After tagging a chain and while its links are highlighted, look closely and you'll see that only one link's text is in black; the others' is in the entity color. The link with black text is the one selected. Move from link to link, either with the arrow keys or the Chooser buttons, and you'll see this color difference moving along, although all the links will remain highlighted as long as any one of them is selected. The Chooser's "grow" and "shrink" buttons will affect the link that is currently selected.
[2004-11-12] Unrestricted chaining can create insuperable problems for treebanking. Ann and Seth agree that chaining for split coordinations works well, but otherwise don't use it at all unless there is an explicit exception in the guidelines. (At present we have exactly one such exception.) If you see another situation where you think chaining would be appropriate, by all means ask, but don't do it until it's been approved.
This means that we only use it in entity annotation, not POS annotation, let alone paragraph and sentence/section. See archive.
In info-gathering mode, don't chain at all : continue to sweep out the whole text of the entity reference. As of March 2004 this applies only to Malignancy, in the oncology domain.
A string with an embedded abbreviation, like
This applies even to an abbreviation embedded in a split coordination:
(These are on Eric & Jeremy's list already.)
When you change the tag on a chain, the text color changes only on the link that is selected (and you can't see that till you select something else). The other links' colors remain unchanged until you select them, or until the file is closed and reopened.
You can't extend a chain. If you build a chain of two links and tag it, then realize that you meant to include another link, you have to untag the chain (Chooser '–' button, or Backspace key) and then build it again.
2004-11-10: Table of contents
| Annotators' home |
2004-11-12