Converge Bio reposted this
I really liked the approach described by Hou et al. in their recent Cell Press publication: Using artificial intelligence to document the hidden RNA virosphere (the link to the paper is given in the first comment below). The authors' goal was to identify new RNA viruses from metatranscriptomes, which are RNA sequencing of non-isolated samples containing many different types of organisms. They combined two approaches - one was a classic sequence homology based bioinformatic search of a hallmark RNA virus gene (RdRP), this is the left branch in the figure below. The other was a language model allowing for capturing of more abstract similarity, such as structural similarity; this is the right branch in the figure below. I like this combination of classic and modern, where prior knowledge is used directly through classic tools, and amplified by new cutting edge GenAI tech. Nice work!