Important Concepts underlying searching in the DQC


Browse Titles Help

The Browse Titles screen allows a user to obtain a list of work/documents in one of two ways.

  • Select a letter - click on a letter to view all works whose title begins with that letter.
  • Search - enter query terms into the input box and click on Find Titles to find titles containing ANY of the query terms entered.
  • When you click on or select a title in the result list obtained from a browse titles action, you will be presented with the Table of Contents of the work. From there you will be able to navigate to the pages and sections of the work.

    The Organization of the Digital Quaker Collection

    The DQC is comprised of books, journals, and Corpus volumes. A Corpus being an edited compilation of individual works. In total, the DQC encompasses over 500 "works".

    Each book, journal, or work which is contained in a Corpus has been "loaded" into the software upon which the system is built in a manner which allows a user to identify any given work by entering a search query or by browsing the DQC by Title or by Author.

    The browsing functions of the DQC provide a mechanism for listing the Titles and Authors of works based on user selection criteria such as a word in a title or the name of an author. In browse mode, user selection of a title in the result list causes the TOC (Table of Contents) for the work to be displayed. From there the user can navigate to pages in the work and/or page images of pages in the works - or she/he can return to the result list of works or to other system functions.

    Each "work" (also called a "document" within these pages and the interface) is XML encoded data. Each work is composed of many nodes - and in turn each node may contain many other nodes - paragraghs within chapters within volumes; lines within poems within letters; etc.

    The searching functions of the DQC produce a search result set which is a list of the "works" which have been "loaded" into the underlying software and which satisfy the users query. The query is executed against a whole work NOT against specific nodes in it. I.E., the result set is a list of "works" which satisfied the query, not a list of nodes within a work.

    When a user selects a "work" from the list of titles in the result set, the system then displays the nodes within the "work" which contained one or more of the words which comprised the query. Please note that it is quite likely that all of the query terms that you entered will not appear in each node. So if you entered the query: heaven AND hell, the complete work in the result list contains both terms, but any given hit node may only contain one of the query terms. An exception to this rule occurs when you've entered a "phrase" as part of your query - in this case indeed, the hit nodes will contain the phrase.

    From a hit node, the user may elect to display the page on which the hit node occured or may elect to "expand" the hit node to the next larger node containing it. For example, if the hit node represents a paragraph, the expand function may then display the chapter which contained the paragraph - depending on the underlying structure of the XML tagging.

    What is a document?

    A document (or record) in the Digital Quaker Collection (DQC) is a complete work - whether that be a book, a journal or in the case of a Corpus one of the works which comprises the Corpus. The documents are XML encoded entities, and as such are themselves comprised of smaller pieces which are called nodes. Nodes are things like paragraphs, chapters, lines in a poem, etc.

    In terms of the DQC software, a document is that entity which can be retrieved as the result of a search (or by browsing titles or authors). It should be noted that when you elect to view one of the documents in a search result set, the first screens that appear are the nodes of the document that contained the query terms you entered. The query terms should be highlighted. From there you will be able to View the page in the document which contains the node, or to expand the node to the node which encompasses it. (I.E., you can expand a paragraph to the chapter which contains it).

    Please note that it is quite likely that all of the query terms that you entered will not appear in each node. So if you entered the query: heaven AND hell, the documents in the result list contain both terms, but any given hit node for a specific document may only contain one of the query terms.