Developing the epistemic model of documents with marked up text from articles based on the 'Fragmenter'
Based on Anita de Waard’s work, we use a simple representation of ‘Epistemic Segment Types’. These consist of the following codes:
fact
: a claim that has been accepted to be true, a known fact.hypothesis
: a proposed idea, not supported by evidenceproblem
: unresolved, contradictory, or unclear issuegoal
: research goalmethod
: experimental methodresult
: a restatement of the outcome of an experimentimplication
: an interpretation of the results, in light of earlier hypotheses and factsother-goal
: Goal described in a cited paperother-implication
: Interpretation made in a cited paperother-problem
: Problem described in a cited paperother-fact
: Fact with an explicit referenceother-hypothesis
: Hypothesis presented in a cited paperother-method
: Method description in a cited paperOther-Result, other-result
: Experimental Result in a cited paper
The BioScholar
system is the web application I am building to house the framework for knowledge management and engineering based on the KEfEd / CoSI model. The Digitial Libary
is concerned with all aspects of managing papers and the Fragmenter
is simply the subcomponent within that that handles delineating fragments of text from within the overall body of the document. We have advanced this system significantly.
Figure 1 shows the latest version of the annotation system. This image shows the basic formulation of the Digital Library as a way of managing the full text documents of papers (PDFs, XMLs, HTMLs). It provides fragmenting capability over PDF files where each numbered fragment may be made up of several blocks, each fragment may also be assigned a fragment type
(based on the epistemic categories shown above).
Figure 1: Screenshot of the fragmenter superimposed over Sasaki et al. 2011
Figure 2 shows the list of fragments.
Figure 2: Screenshot of the list of fragments themselves (there is a bug where the fragments are ordered by numbers in their string order, not numeric).
Note that Figure 1 has a button labeled brat
. This permits the user to automatically dump the current fragments to an installation of the brat
annotation system for subsequent markup and processing (see Figure 3).
Figure 3: A list of 14 fragments pertaining to experimental work from Sasaki et al. 2011 where each fragments delineates a separate experiment.
The Bioscholar system now has implemented an annotation framework that permits annotators to process texts according to these epistemics codes is shown at the following link: