Similaridade entre sequências
- Created by
- Renato Passos, Eng. de Software
- Reviewed by
- Renato Passos, Eng. de Software
Last updated: Apr 18, 2026
About this calculator
The sequence similarity calculator is a tool used to compare two character sequences, such as DNA, RNA, or protein sequences. It calculates the similarity between sequences based on the number of matches (identical characters) and the total number of characters compared. The formula used is sim = (matches/total)·100, which provides similarity as a percentage.
This calculator works by comparing character by character the two input sequences and counting the number of matches (coincidences). Then, it divides the number of matches by the total number of characters compared and multiplies by 100 to obtain the similarity percentage. This method is widely used in bioinformatics and phylogenetics to analyze the evolutionary relationship between different organisms.
Sequence similarity is an important measure in phylogenetics, as it helps to infer the evolutionary history of species. The higher the similarity between sequences, the higher the probability that the species are evolutionarily close. However, it is essential to be aware of the limitations of the method, such as the possibility of evolutionary convergence, where unrelated sequences may exhibit similarity due to similar selective pressures.
When using this calculator, it is crucial to consider the quality of the input sequences and the possibility of sequencing errors. Additionally, it is essential to remember that sequence similarity is not the only criterion for inferring evolutionary relationships, and that other factors, such as the presence of gaps and regional variability, should also be considered.
Frequently asked questions
What is sequence similarity?
Sequence similarity is a measure that expresses the degree of similarity between two character sequences, such as DNA, RNA, or proteins.
How is similarity calculated?
Similarity is calculated based on the number of matches (identical characters) and the total number of characters compared, using the formula sim = (matches/total)·100.
What are the applications of sequence similarity?
Sequence similarity is widely used in bioinformatics and phylogenetics to analyze the evolutionary relationship between different organisms.
What are the limitations of the method?
Limitations include the possibility of evolutionary convergence and the need to consider the quality of the input sequences and other factors, such as the presence of gaps and regional variability.