Metagenomics - E-value & Bit-score (2024)

The E-value (expectation value) is a corrected bit-score adjusted to the sequence database

sequence database
In the field of bioinformatics, a sequence database is a type of biological database that is composed of a large collection of computerized ("digital") nucleic acid sequences, protein sequences, or other polymer sequences stored on a computer. The UniProt database is an example of a protein sequence database.
https://en.wikipedia.org › wiki › Sequence_database
Sequence database - Wikipedia

size. The E-value therefore depends on the size of the used sequence database. Since large databases increase the chance of false positive hits, the E-value corrects for the higher chance.

Read On ›

What is the relationship between bit score and E value? ›

Bit score is a normalized score and hence it is independent of the size of the database, while E- values are very sensitive to the database size. Generally, bit scores of 40 or higher are considered reliable.

Discover More Details ›

What does an E value of 0.01 mean? ›

If E is between 0.01 and 1e - 50, the match can be considered a result of hom*ology. If E is between 0.01 and 10, the match is considered not significant, but may hint at a tentative remote hom*ology relationship. Additional evidence is needed to confirm the tentative relationship.

What is the difference between E value and BLAST score? ›

In addition to the bitscore, an e-value is reported for each BLAST hit. This value indicates whether this hit may be due to chance, rather than a real similarity between query and hit sequence. The e-value is based on the bitscore, but is transformed according to the sizes of the query and the database.

See Details ›

What does a high E value mean? ›

Lower (i.e., stronger) E-values indicate more significant alignments, suggesting a higher probability that the sequences share a common evolutionary origin. A higher (i.e., weaker) E-value indicates that the alignment might be a random event.

Find Out More ›

What is the score and E value? ›

The Expect value (E) is a parameter that describes the number of hits one can “expect” to see by chance when searching a database of a particular size. It decreases exponentially as the Score (S) of the match increases. Essentially, the E value describes the random background noise.

Tell Me More ›

What is the significance of the bit score? ›

Bit score is an important measure that gives an indication about the statistical significance of an alignment. In simple terms, the higher the bit score, the more similar the two sequences are. Bit scores below 50 are generally assumed to be untrustworthy.

Show Me More ›

What E-value is statistically significant? ›

In principle E-value lower than 0.05 can be considered as a statistically significant hit. However, in practice one consider even more stringent E-value cut-offs. A hit may have very low E-value but still can be a false positive.

Explore More ›

Can E value be greater than 1? ›

The e-value is basically a measure of how many such alignments you would expect to find in a database this size by chance. Therefore, e-values greater than 1 mean that you'd expect at least one alignment similar to what you've found by chance alone.

What does an E value of 0.0 represent? ›

the e value give a measure of the similarity of sequences, the lower the e value, the higher the congruity of your query sequence and the retrieved sequence. e values of 0 mean that there's an exact match for you sequence here...

Show Me More ›

Is an E-value of 0 good? ›

An e-value of 0.0 means zero sequences can/are expected to match as well or better; the closer the e-value is to zero, the more significant (and less of a potential false positive) the match is considered to be.

Read The Full Story ›

What is a lower E-value in BLAST? ›

E-value: Indicates the number of hits or alignments that are expected to be seen by random chance with the same score or better. The lower the E-value, the more significant the alignment (the closer to 0, the better).

See Details ›

What does an E-value of 3 or less represent? ›

Within a database of a particular size, "E-value" is the number of results that may come up. If you get an E-value of 3 or less than you have a very good chance that the match is meaningful and not due to random chance.

Get More Info Here ›

What does a positive E value mean? ›

If the value of E°_cell is positive, the reaction will occur spontaneously as written. If the value of E°_cell is negative, then the reaction is not spontaneous, and it will not occur as written under standard conditions; it will, however, proceed spontaneously in the opposite direction.

What is a good BLAST result? ›

BLAST results do not typically attempt to match the full length of a sequence. A high Query Cover value for the initial triage is in the 70%+ range. If the top results fall below this range, it would generally be a good idea to review the sequence more in the future, and not verify it as a part of your initial triage.

How to interpret BLAST results? ›

The list of hits starts with the best match (most similar). E-value: expected number of chance alignments; the smaller the E-value, the better the match. First in the list is the query sequence itself, which obviously has the best score.

View Details ›

What is the relationship between database size and E-value for hits with identical alignment score? ›

The E-value is directly proportional to the database size. Note: Conceptually this is easy to understand - getting an alignment with the given score (205 bits) is more SIGNIFICANT in the smaller database. In larger database there is a larger chance of randomly picking up matches.

What is the formula for bit score? ›

The bit-score (S) is determined by the following formula: S = (λ × S − lnK)/ ln2 where λ is the Gumble distribution constant, S is the raw alignment score, and K is a constant associated with the scoring matrix.

Learn More ›

What is the E-value in sequence alignment? ›

The e-value represents the expectation of finding that sequence by random chance. So if you search a short sequence you are likely to have a lot more hits with high e-value (low significance), and if you search a long sequence you are likely to have fewer hits with lower e-value (greater significance).

Discover More Details ›

What is the E-value and what is the significance of this value in an alignment? ›

The relevant statistic is called the Expect Value or e-value. Expect value — for a particular match, the number of chance alignments expected with the same score or a better one. The Expect value is an exponentially decreasing function of the score and is directly proportional to the search space.

Show Me More ›

Metagenomics - E-value & Bit-score (2024)

See Also
NCBI Outreach Events - NCBI Insights Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches What is e value? - Bioinformatics and Biostatistics Examining Your BLAST Results
E-value

Bit-score

FAQs