Table of Contents | Introduction | Resources | Search Strategies | Overall Search Strategy | Example Searches | Checklist and Hints & Tips | Glossary and Connection Details

PREVIOUS | NEXT


3. Search Strategies

The various different search strategies described in this section are complementary - they are designed to be used together. If one doesn't work - try one of the others, or try it on a different resource. We use examples that are based on real search needs that we have encountered in our work on this project.

3.1 Keyword Searching

This is the most common search strategy and the most important as it can be applied to almost any online resource. It simply means that you specify the subject you are interested in by searching on a series of words. The words are those you would expect to find in an article title (or description) of your area of interest.

For example, say you are interested in the role of women in Buddhism. One approach is to divide this up into concepts and list the words that might be expected to be associated with these concepts.

Concept Women Buddhism

Keywords
women
woman
female
feminist
Buddhism
Buddhist
Buddha
India
Tibet
Thailand
religion

The point is that your area of interest may be defined by several words. If you restrict yourself to just women and Buddhism then you may miss an article/book called 'Experiences of Buddhist women' or 'A feminist critique of religion in Tibetan society'. Authors of articles and books can express the same ideas in several different ways.

For a real example, in the BIDS ISI Arts & Humanities database for 1993-6:

If you only used one of these keywords you would miss a large number of relevant articles. There is an easy way to overcome this problem which exploits the similarity of words describing similar concepts and it is known as truncation searching (or wildcards).

3.1.1 Truncation and Wildcards

The words Buddhism, Buddhist and Buddha all describe closely related concepts and all share a common stem - Buddh. What we really want the search to do is to match on any word that starts with these letters. To do this we can use a wildcard character which can stand for any other letter or letters. The wildcard character can vary between systems but it is usually the asterisk, *. This allows us to truncate the search term like this:

buddh*

Returning to our BIDS search:

Not only is this truncated search quicker but it is more reliable - if you select the right stem then you can search for things which are relevant but you hadn't even considered! For example, if you use the stem wom* to catch both women and woman you may also get:

Of course this can be bad thing if you accidentally incorporate other words (e.g. wombat) into your search hits. A dictionary can be very useful in choosing appropriate word stems for a search.

3.1.2 Combining Searches: AND, OR and NOT

Although using truncation is an important stage in widening this search we also need to include other terms. Most systems allow you to specify several searches and then combine them using three operators: AND, OR and NOT

The AND operator allows you to find those items which are common to two different searches, for example:

will give you a list of articles that contain both women and buddhism

On BIDS this search returned several articles, including:

Technically, the AND operator returns the intersection of the two search sets. One approach is to try to see the result as a Venn Diagram:

The AND operator returns only those articles in region A of the Venn Diagram. Each search is called a set and usually associated with a number. Different systems use different characters to represent the AND operator, BIDS uses +. Although you can also use &.

For example on BIDS, the women AND buddhism search is like this:

Set 3 is produced by giving the set number 1, the operator + (AND) and the set number 2.

The OR operator is used to join together two search sets to produce a bigger search set. In our example, we want to join together the wom* search with another one including female and feminist (fem* should be fine) to produce the women concept set.

On BIDS the OR operator is a comma ,

Then we can use an AND to combine this with the Buddhism concept (we will use Buddh* for simplicity for now). The result on BIDS is:

Set 3 is the women concept creating by using an OR on sets 1 and 2. Set 4 is the Buddhism concept using a simple truncation search. Set 5 is where it all comes together.

Note that we got 13 hits, when we tried the simple women AND Buddhism we got just 3. In other words with the simple search we missed 77% of the things we found using the more complex search!

Sometimes a word, or truncated word stem, brings back lots of words we don't want. It is possible that using wom* could have brought back numerous articles on wombats. The final operator, NOT, allows us to get rid of those whilst keeping the ones we want.

To use NOT in this way we could say:

On BIDS the symbol for NOT is the minus sign, -

Trying this search in the BIDS BL Inside Information system we get:

We successfully removed the 17 occurrences of wombat*. In this case it wasn't terribly important but the principle can be useful and save a lot of time in some searches.

3.1.3 Choosing Keywords

A few examples should illustrate the importance of choosing the right keywords. If you were searching for articles about the leader of Libya who would you search for:

For the political history of Russia:

Similarly:

American spellings / phrases are a common problem:

On many systems there are words, known as stop words, which are automatically removed from your search expression. These are typically common words such as:

This can make it difficult to search for certain expressions that contain these words, e.g. "to be or not to be". However, there is usually a mechanism for forcing the system to look for these words - but you will probably have to read the system specific help information.

3.2 Person-based Searching

The most common form of person-based searching is an author search - where the keyword is the author's name. The main problems encountered are:

authors and database compilers are sometimes inconsistent in the way they refer to themselves. This most commonly occurs by omitting, or including, a middle initial. For example, a BIDS ISI Science Citation Index search for the author John Self returns completely different results depending on whether his middle initial is used.

A search for SELF_JA produces:

A search for SELF_J produces:

The two searches don't even have any common items! But it is the same person!

people change their names. A search for Micheline Beaulieu between 1993-6 in the BIDS ISI Social Science database produces 7 hits including:

However, a search for Micheline Hancock-Beaulieu in the same database in the same year range produces 7 entirely different hits, including:

Note that on BIDS you should omit hyphens for double barrelled names: Micheline Hancock-Beaulieu is searched for as HANCOCKBEAULIEU_M

3.3 Institution-based searching

This strategy relies on the grouping together of people with similar interests into research (or other institutional) groups. The chances of another person sharing some of the same interests (and therefore writing about the same topics) as another person are much higher if they belong to a common group.

The most effective method to achieve institution-based searching is via the Web. The steps are:

3.4 Publication-based Searching

This is a very straightforward strategy. If you find a journal which contains one or more articles or interest then it probably worthwhile having a quick look at other articles in the same journal or publication. This can be done very quickly by hand in the Library or via BIDS.

Alternatively you can use journals to narrow down a search in combination with another search term. Using stress as a keyword will have radically different results if you combine it with the Journal of Occupational Psychology compared with the Journal of Mechanical Engineering Science.

3.5 Citation Searching

Citation searching is a powerful search strategy that is especially useful if either:

A citation is another term for a reference - when an author refers to a previous piece of work in the text of the article. They are usually gathered together at the end of the article in a long alphabetical list - although sometimes they are included as footnotes. Some database indices, such as BIDS ISI, include these citations - establishing links between articles. These links are not dependent on the articles having the same keywords - they are an indication that one author thought the cited article was in some way relevant.

What this means is that you can specify an article and then see a list of all the authors in the database who subsequently referred to that article. For example, asking for a citation search on BIDS ISI Science Citation Index for TWIDALE_M produces this list:

The first thing to note is that the same warnings about inconsistent naming still apply, doing the same for TWIDALE_MB only produces 1 entry:

Each entry refers to a different citation - although entries 6 and 7 are actually the same article - and the number on the right is the number of times that article has been cited. In general you would expect older articles to have been cited more frequently. Selecting one of these, the 1995 M.B. Twidale entry, leads us to the article:

This is the only article, so far, to have cited the 1995 article in the Journal of the Association for Learning Technology. Note that this journal is not even indexed by BIDS ISI.

Citation searching allows us to look forward in time from an article and see who is referring to that work. It is a useful way of finding out about new authors in a field.

3.6 Similarity Searching: making use of other people's searches

People often repeat searches that others have done in the past. Usually there is no way to re-use these searches and so save time and effort. However there are cases when these searches are published so that we can successfully re-use them:

Whenever items have been organised into categories similar things are found next to each other. This is most obvious in a Library or Bookshop, where books on the same topic or by the same author are on the same shelf. In Lancaster Library the Classmark associated with each item allows you to look for things with the same, or a similar, Classmark. Hierarchical Indices on the Web, or a CD-ROM, also enable you to look for similar things. Some Web search engines also have a 'Show me similar Web pages' option.

Some types of articles are reviews of a particular field - summaries of the main trends and issues. These are particularly useful for quickly understanding an area or finding the classic references. On BIDS ISI you can restrict your searching to just reviews through the 'Limit searching by document type' restriction on the Options menu.

This gives a large menu of different types of document. Selecting the Review option will then mean that your search will only find review articles. Combining this restriction with a normal keyword search should enable you to quickly find review articles about a particular topic (if they exist).

Some collections of references are published as books and pamphlets, these are also a useful means of taking advantage of other peoples' searches, for example:

There are several Web indices which have been set up to aid searchers by collecting, evaluating Web sites and other online sources. In the UK the best sources for these are collected at NISS (National Information Services and Systems) Web site in the Directory of networked resources.

Examples include:

These indices are the results of several people collecting and organising information. However, the Web also allows people to publish their own lists. It is quite common for a Web surfer to have a link on their Home page to their own list of bookmarks. These public bookmarks allow you to exploit their efforts at finding things. You can locate people's Home pages either by navigating to them or through a search engine.


PREVIOUS | NEXT