INSD (DDBJ/EMBL-Bank/GenBank) Help

How to Query

  1. Searching with multiple keywords
  2. Including or excluding words from your search
  3. Wildcard searches
  4. Fuzzy searches
  5. Proximity Searches
  6. Boosting a keyword
  7. Grouping
  8. Searching with keywords that contain special characters

1. Searching with multiple keywords

To search with multiple keywords, separate each word with a space such as "cell protein". This will search for INSD entry with either of the keywords. If you want to search for INSD entry with all of the keywords, use the AND operator, such as "cell AND protein".

2. Including or excluding words from your search

You can use operators to include or exclude specific words from your search.

A union using sets - OR operator or || symbol

This will search for INSD entry with either of the keywords. For example, "superfamily OR transcription" or "superfamily || transcription" will search for INSD entry related to either superfamily or transcription.

An intersection using sets – AND operator or && symbol

This will search for INSD entry which contain both of the keywords. For example, if you type "superfamily AND transcription" or "superfamily && transcription", a search for INSD entry which contain both keywords "superfamily" and "transcription" will be performed.

A difference using sets – NOT operator or ! symbol

The NOT operator excludes INSD entry that contain the keyword after the NOT operator. For example, if you type "superfamily NOT transcription" or "superfamily !transcription", a search for INSD entry which contain superfamily but does not contain transcription will be performed.

Plus (+) symbol

The "+" symbol performs a search for INSD entry that must contain the keyword which comes after the "+" symbol. For example, use "superfamily +transcription" to search for INSD entry that may contain "superfamily" but must contain "transcription".

Minus (-) symbol

The "-" symbol excludes INSD entry that contain the keyword which comes after the "-" symbol. For example, use "-superfamily –transcription" to search for INSD entry without "superfamily" and without "transcription".

3. Wildcard searches

You can use the following characters for wildcard searches.

Question mark (?) for single character wildcard search

The single character wildcard search will look for images with keywords which match that with the single character replaced. For example, use "sho?t" to search for "short" or "shoot".

Asterisk (*) for multiple characters wildcard search

The multiple character wildcard search will look for images with keywords that match that with 0 or more characters replaced. For example, use "Oshox*" to search for "Oshox1" or "Oshox".

4. Fuzzy searches

You can use the tilde (~) symbol at the end of a single keyword to perform a fuzzy search which will look for other words with spelling similar to the keyword. For example, use "kasalath~" to search for INSD entry related to "kasalath" or "karalath".
Furthermore, you can specify the edit distance (positive integer) after the tilde, e.g., "indica~2". Edit distance refers to the minimum number of operations (insertion, deletion or replacement) required to convert the keyword into the targeted word. For example, when you search with “indica~2” as the keyword, the results will include words with the edit distance of 2 such as indica, induc.

5. Proximity Searches

You can add the tilde (~) symbol and a number at the end of a phrase to specify the distance of the keywords with one another. For example, use "protein kinase"~10 to search for information with "protein" and "kinase" within 10 words of each other.

6. Boosting a keyword

You can boost a keyword by adding the caret (^) symbol with a boost factor (a number) at the end of the keyword. The boost factor must be a positive number and the higher the number, the more relevant the keyword is. By default, the boost factor of a keyword is 1. For example, use "kinase^2 protein^0.1" to indicate that leaf is twice as relevant and the embryo is one-tenth as relevant.

7. Grouping

Keywords grouped together with parentheses () will be prioritized in the search. For example, use "(disease OR embryo) AND stress" to search for stress that are disease or embryo.

8. Searching with keywords that contain special characters

You can escape the supported special characters (+ - && || ! ( ) { } [] ^ " ~ * ? : \) with a backslash (\) to search for keywords which contains special characters. For example, use "\(IRRI\)" to search for (IRRI).

/rice/oryzabase