Copypasta Quote from PseudoDave
"...it's takin out of context and incomplete. Also frankly, such a short sequence is going to come up by random chance many many times in nature.
Here is the full list, excluding SARS CoV2
[link to blast.ncbi.nlm.nih.gov (secure)
In short, the sequence appears 100% identical about 40 times, and near identical well over 100 times. Since bacterial diversity is crazy, prob 1B times animals, it's expected to appear highly via random chance in bacteria and means absolutely nothing.
Should also point out, the nucleic acid sequence, CGTA, is pretty irrelevant. It's the protein coding sequence that is important when discussing proteins. So, makes this even dumber.
Quick run down on what this is and how to read it, I do this for a living.
You enter a DNA sequence, and it scans all sequenced genomes available in the database, from bacteria to cows, to random unknown stuff found in oceans.
It kicks back scores on how close the DNA sequence matches. Coverage is how much of the DNA covers i.e. 50% of the sequence is 100% identical. And percentage identical is how close the match is within that cover range. So 100% coverage at 100% match is completely identical. The scores are there scoring algorithm, higher=better. The description is the name of the sample/life form which has the DNA, and the Ascension number is the database location.
Generally, we use BLAST to figure out what a gene is or where it came from to track evolution or find similar functioning organisms. So normally enter 1000+ DNA bases, not 19 nucleic acids. As there is only 4 DNA bases, every genome is comprised of those 4 in different orders. As a normal bacteria has about 4,000,000 bases, and there are 1,000,000s of different bacteria, the chance it comes up via random chance is crazy high."