Anonymous ID: 98b53b Dec. 23, 2025, 5:44 p.m. No.24021691   🗄️.is 🔗kun   >>1714

>>24021538

(be afraid…. be VERY afraid)

 

Sequence modeling and design from molecular to genome scale with Evo

 

Editor’s summary

Large language models have great potential to interpret biological sequence data. Nguyen et al. present Evo, a multimodal artificial intelligence model that can interpret and generate genomic sequences at a vast scale. The Evo architecture leverages deep learning techniques, enabling it to process long sequences efficiently. By analyzing millions of microbial genomes, Evo has developed a comprehensive understanding of life’s complex genetic code, from individual DNA bases to entire genomes. This enables the model to predict how small DNA changes affect an organism’s fitness, generate realistic genome-length sequences, and design new biological systems, including laboratory validation of synthetic CRISPR systems and IS200/IS605 transposons. Evo represents a major advancement in our capacity to comprehend and engineer biology across multiple modalities and multiple scales of complexity (see the Perspective by Theodoris). —Di Jiang

 

Structured Abstract

INTRODUCTION

The fundamental instructions of life are encoded in the DNA sequences of all living organisms. Understanding these instructions could unlock deeper insights into biological processes and enable new ways to reprogram biology into useful technologies. However, even the simplest microbial genomes are incredibly complex, with millions of DNA base pairs encoding the interplay of DNA, RNA, and proteins—the three modalities of the so-called central dogma of molecular biology and the key actors in cellular function. This complexity exists at multiple scales, from individual molecules to whole genomes, representing a vast landscape of genetic information that has been functionally selected over evolutionary time.

RATIONALE

Rapid progress in artificial intelligence (AI) has led to large language models that demonstrate increasingly advanced multitask reasoning and generation capabilities when trained on massive amounts of data. However, technological limitations in the architecture of these models have restricted efforts to apply them to biology at a similar scale. Current approaches struggle to analyze sequences at the individual character level and are computationally demanding when applied to long sequences. An advanced model maintaining single-nucleotide resolution over large genomic sequences could potentially extract functional information about the complex molecular interactions that are embedded in the patterns of natural evolutionary variation.

RESULTS

In this work, we present Evo, a genomic foundation model that enables prediction and generation tasks from the molecular to the genome scale. Using an architecture based on advances in deep signal processing, we scaled Evo to 7 billion parameters with a context length of 131 kilobases at single-nucleotide resolution. We report scaling laws on DNA, complementing similar observations in natural language and vision. Trained on 2.7 million prokaryotic and phage genomes, Evo demonstrates zero-shot function prediction across DNA, RNA, and protein modalities that is competitive with—or outperforms—domain-specific language models. Evo also excels at multimodal generation tasks, which we demonstrated by generating synthetic CRISPR-Cas molecular complexes and transposable systems. We experimentally validated the functional activity of Evo-generated CRISPR-Cas molecular complexes as well as IS200 and IS605 transposable systems, representing the first examples of protein-RNA and protein-DNA codesign with a language model. Using information learned over whole genomes, Evo learns how small changes in nucleotide sequence affect whole-organism fitness and can generate DNA sequences with plausible genomic architecture more than 1 megabase in length.

CONCLUSION

Evo is a foundation model that is designed to capture two fundamental aspects of biology: the multimodality of the central dogma and the multiscale nature of evolution. The central dogma integrates DNA, RNA, and proteins with a unified code and predictable information flow, whereas evolution unifies the vastly different length scales of biological function represented by molecules, pathways, cells, and organisms. Evo learns both of these representations from the whole-genome sequences of millions of organisms to enable prediction and design tasks from the molecular to genome scale. Further development of large-scale biological sequence models like Evo, combined with advances in DNA synthesis and genome engineering, will accelerate our ability to engineer life.

 

https://www.science.org/doi/10.1126/science.ado9336

 

full paper available for dld (registration req'd)

Anonymous ID: 98b53b Dec. 23, 2025, 5:52 p.m. No.24021714   🗄️.is 🔗kun   >>1715

>>24021538

>>24021691 (me)

 

==Genome modeling and design

across all domains of life with Evo 2==

 

Abstract

All of life encodes information with DNA. While tools for sequencing, synthesis, and editing of genomic code

have transformed biological research, intelligently composing new biological systems would also require a

deep understanding of the immense complexity encoded by genomes. We introduce Evo 2, a biological

foundation model trained on 9.3 trillion DNA base pairs from a highly curated genomic atlas spanning all

domains of life. We train Evo 2 with 7B and 40B parameters to have an unprecedented 1 million token con-

text window with single-nucleotide resolution. Evo 2 learns from DNA sequence alone to accurately predict

the functional impacts of genetic variation—from noncoding pathogenic mutations to clinically significant

BRCA1 variants—without task-specific finetuning. Applying mechanistic interpretability analyses, we reveal

that Evo 2 autonomously learns a breadth of biological features, including exon–intron boundaries, transcrip-

tion factor binding sites, protein structural elements, and prophage genomic regions. Beyond its predictive

capabilities, Evo 2 generates mitochondrial, prokaryotic, and eukaryotic sequences at genome scale with

greater naturalness and coherence than previous methods. Guiding Evo 2 via inference-time search enables

controllable generation of epigenomic structure, for which we demonstrate the first inference-time scaling

results in biology. We make Evo 2 fully open, including model parameters, training code, inference code, and

the OpenGenome2 dataset, to accelerate the exploration and design of biological complexity.

 

Introduction

Biological research spans scales from molecules to systems to organisms, seeking to understand and design

functional components across all domains of life (Darwin, 1859; Mendel, 1866; Dobzhansky, 1951). Creating

a machine to design functions across the diversity of life would require it to learn a deep, generalist represen-

tation of biological complexity. While this complexity surpasses straightforward human intuition, advances in

artificial intelligence offer a universal framework that leverages data and compute at scale to uncover higher-

order patterns (Vaswani et al., 2017; Kaplan et al., 2020). We reasoned that training a model with these

capabilities would require data spanning the full spectrum of biological diversity to discover emergent prop-

erties similar to those found in other fields (Radford et al., 2019).

All domains of life express complex functions from DNA sequences (Watson and Crick, 1953; Nirenberg and

Matthaei, 1961), yet genomic content and length vary dramatically across organisms. Prokaryotic genomes

maintain relatively simple organization (Jacob and Monod, 1961; Overbeek et al., 1999), while eukaryotic

evolution has produced intricate genomic architectures characterized by extensive noncoding regions, alter-

native splicing patterns, and multiple layers of epigenomic control (Chow et al., 1977; Brownell et al., 1996).

These features underpin the emergence of multicellularity, sophisticated traits, and intelligent behaviors that

are unique to eukaryotic life (Szathmáry and Smith, 1995).

We previously demonstrated that machine learning models trained on prokaryotic genomic sequences can

model the function of DNA, RNA, and proteins, as well as their interactions that create complex molecular

machines (Nguyen et al., 2024a; Merchant et al., 2024). However, extending this sequence modeling paradigm

to eukaryotic genomes would require advances in data curation, model architecture, training and inference

infrastructure, and inference-time compute to address the scale and complexity of eukaryotic genomes.

 

1/2

Anonymous ID: 98b53b Dec. 23, 2025, 5:53 p.m. No.24021715   🗄️.is 🔗kun

>>24021714

 

2/2

 

Here we present Evo 2, a biological foundation model that is trained on a representative snapshot of

genomes spanning all observed evolution. Emphasizing generalist capabilities over task-specific optimization,

Evo 2 achieves robust prediction and generation performance from molecular to genome scale and across all

domains of life. We trained two versions of Evo 2 at 7B and 40B parameters, leveraging over 9.3T tokens

at single-nucleotide resolution. These models were trained with a context window up to 1M tokens and

demonstrate effective retrieval across the full context. To enable the research community, we release, to our

knowledge, the largest-scale fully open language model to date, including open-source training code, inference

code, model parameters, and the OpenGenome2 training data.

Evo 2 exhibits strong performance across biological sequence tasks. Building upon our previous work

(Nguyen et al., 2024a), Evo 2 learns how mutations affect protein, RNA, and organismal fitness, while now

generalizing beyond prokaryotes to include humans, plants, yeast, and other eukaryotes. Remarkably, without

any variant-specific training, architectural optimization, or multiple sequence alignments, Evo 2 is the first

language model capable of scoring the impact of all variant types on pathogenicity and splicing, achieving

accurate and state-of-the-art performance in predicting the pathogenic effects of noncoding variation. Fur-

thermore, a supervised model built on Evo 2 embeddings attains state-of-the-art performance on classifying

BRCA1 variants of unknown significance in breast cancer.

To elucidate the model’s learned concepts, we applied mechanistic interpretability techniques that de-

compose large language model representations into understandable components (Cunningham et al., 2023;

Bricken et al., 2023). Using sparse autoencoders (SAEs), we identified a diverse set of features corresponding

to key biological signatures, including intron and exon boundaries, transcription factor motifs, and protein

structure characteristics. These feature-based annotations can also be leveraged for discovery tasks, such as

identifying prophage regions and mobile genetic elements.

Evo 2 can also leverage its unique representation of biological complexity to generate new genomic se-

quences. We first demonstrate unconstrained generation of genome- and chromosome-scale sequences with

improved naturalness compared to previous genomic language models. This includes the ability to gener-

ate complete mitochondrial genomes, minimal bacterial genomes, and entire yeast chromosomes. We also

demonstrate how inference-time search can guide generation with Evo 2 to successfully achieve complex de-

sign tasks. In particular, we demonstrate controllable generation by using models of epigenomic state to design

novel DNA sequences for which we can specify the location and length of chromatin-accessible regions, allow-

ing us to write simple Morse code messages into our epigenomic designs. In doing so, we demonstrate the

first inference-time scaling results for biological language modeling, extending our previous work that showed the first scaling laws for DNA sequence pretraining.

Evo 2 and future iterations of the DNA foundation modeling paradigm represent the first steps toward

generative biology for genomic and epigenomic design. This computational ability, combined with our recent

experimental advances in large-scale programmable DNA manipulation (Durrant et al., 2024), may enable

the direct programming of diverse synthetic life. Furthermore, combined with application-specific scoring

functions to provide inference-time guidance, Evo 2 enables the design of complex biological architecture

beyond DNA alone

 

full paper available at…

https://arcinstitute.org/manuscripts/Evo2

 

(be afraid… be VERY afraid)

Anonymous ID: 98b53b Dec. 23, 2025, 6:04 p.m. No.24021751   🗄️.is 🔗kun   >>1883

>>24021739

>Hate to break it to ya, but we are approx 2 moar weeks away

scientists have determined

that due to relativistic time dilation effects

two moar weeks will not begin

until approx two moar weeks from now

Anonymous ID: 98b53b Dec. 23, 2025, 6:19 p.m. No.24021791   🗄️.is 🔗kun   >>1802

>>24021787

>https://www.health.harvard.edu/staying-healthy/blue-light-has-a-dark-side

LMFAO….

all the sudden, harvard becomes a "TRUSTED SOURCE?"

tell me, einstein, how much "blue light" is there in direct sunlight?

stop now while you've only made an ass of yourself

otherwise, you risk making a COLOSSAL ASS of yourself

Anonymous ID: 98b53b Dec. 23, 2025, 6:24 p.m. No.24021806   🗄️.is 🔗kun   >>1827

>>24021792

>Technology exists for the sole purpose of their transhumanist agenda

go live in a cave, then, asshat

no clothes, no shoes, no tools

catch rabbits with your bare hands

and fish in your teeth

eat 'em all raw

and wipe your dysentery ass with your fingers

imbecile

Anonymous ID: 98b53b Dec. 23, 2025, 6:30 p.m. No.24021823   🗄️.is 🔗kun   >>1826

>>24021816

>confuses color of sky with spectrum of sunlight

14kB limit prevents me from rubbing your face in your arrogant stupidity

i can show you 1000 published articles that "prove" cigarettes are not dangerous, and even CURE tuberculosis

READ A FUCKING BOOK, ASSHAT

when you're done, READ 1000 MORE

Anonymous ID: 98b53b Dec. 23, 2025, 6:36 p.m. No.24021835   🗄️.is 🔗kun   >>1841 >>1844

>>24021829

>LED's permanently changed to a red color that my wife swears feels like microwaves

LMFAO…..

cheap LEDs are permanently damage by voltages above OR below their design specs

btw… how does your wife know what microwaves feel like?

can she feel them coming out of her cellphone?

or the wifi router in your house?

or dija stick her head in the microwave oven when she refused to make you a sammich?

Anonymous ID: 98b53b Dec. 23, 2025, 6:39 p.m. No.24021842   🗄️.is 🔗kun   >>1848

>>24021826

>Tell me, what cells are in the eye, mr. expert.

i'd say you have neuroanalretinopathy… a nerve that runs from your eyeball to your asshole, and gives you a shitty outlook on reality

Anonymous ID: 98b53b Dec. 23, 2025, 6:45 p.m. No.24021861   🗄️.is 🔗kun   >>1873

>>24021848

>questioning technobabble jargonspeak BS make me a socialist

sure thing, sunny jim

say…

ya still haven't told us the intensity of blue light contained in direct sunlight

whattsamatta?

doan unerstan da qwestshun?

Anonymous ID: 98b53b Dec. 23, 2025, 6:50 p.m. No.24021881   🗄️.is 🔗kun   >>1906

>>24021877

>I have exposed a plethora of things, that were initially mocked, only to be eventually proven right.

oh sure…

i think EVERYONE reading your thread understands THAT you are unquestionably highest-ranking-anon

Anonymous ID: 98b53b Dec. 23, 2025, 6:53 p.m. No.24021893   🗄️.is 🔗kun   >>1903 >>1912

>>24021873

>you claimed to have watched a 50 minute video within 2 minutes

i NEVER claimed that

i said your shit is old

been around for decades

was debunked when you were fapping to bra commercials in your mommy's basement

Anonymous ID: 98b53b Dec. 23, 2025, 6:56 p.m. No.24021904   🗄️.is 🔗kun   >>1922

>>24021890

>The exploding pagers were using 5G tech not explosives

damn…

i know of no other place where i can get entertainment like this, at ANY price

and here you are, making anons ROTFLTAO for free

you da man!

Anonymous ID: 98b53b Dec. 23, 2025, 7 p.m. No.24021918   🗄️.is 🔗kun

>>24021901

>>24021901

>I didn't know that's what STEM stood for, too bad someone like you wasn't around to let me know.

if you HAD and actual STEM degree, you'd have a job that would repay your student loan in 20-30 yrs

after that, you could get one of fake trump's 50-yr mortgages, and buy a one-room house in a smart city

Anonymous ID: 98b53b Dec. 23, 2025, 7:05 p.m. No.24021933   🗄️.is 🔗kun   >>1948

>>24021922

>I was the one who warned about the toxic vaccines, mRNA, riots, a fake pandemic, the arrayed gangs, social services and social security fraud, USAID, and 911 was a the beginning of a financial war to destroy the dollar over BRIICS, and way more.

 

well NEXT time you should SIGN your work with your ACTUAL NAME

so everyone can bow and give you the credit you so richly deserve

Anonymous ID: 98b53b Dec. 23, 2025, 7:20 p.m. No.24021977   🗄️.is 🔗kun

>>24021966

>Here is a case in point, my wife warned someone at her church a while back about certain things, which was scoffed at, but unfortunately, they paid the price for not taking heed.

do you even understand what "case in point" means?

it DOESN'T mean referring to "certain things"

what "church" WAS that?

what were those "certain things?"

who were "they" and what "price" did they pay?

Anonymous ID: 98b53b Dec. 23, 2025, 7:37 p.m. No.24022031   🗄️.is 🔗kun   >>2048 >>2074

>>24021994

>American students were set up, degrees were devalued, debts were increased

RIGHT….

you're just another VICTIM

you had ZERO responsibility to find out for yourself what a degree in psychology was actually worth?

did you check the want ads in the sunday paper to see if there were any jobs that actually REQUIRE the degree you were considering?

did you check the alumni association of the school you were considering to see if the grads were all posting success stories?

did you consider all the alternatives that DO lead to good paying jobs, like trade schools?

or were you led like a sheep to the the shears?

you got what you DESERVED

sry…. NOT SRY

Anonymous ID: 98b53b Dec. 23, 2025, 7:52 p.m. No.24022065   🗄️.is 🔗kun   >>2117

>>24022024

>They don’t, we do!

60 million deer hunters

deadly marksmen skilled in camoflage and stalking

armed with patience, hundreds of millions of rifles, and hundreds of billions rounds of ammunition

waiting for a "go" command

the target list is already known

Anonymous ID: 98b53b Dec. 23, 2025, 8:02 p.m. No.24022097   🗄️.is 🔗kun

>>24022074

>Do you just come here to rant your script?

do you just come here to whine and blame everyone but yourself?

 

how much did your little bo peep diploma cost?

how much does the job you imagined pay?

how much could you earn WITHOUT that diploma?

does the DIFFERENCE between reality and fantasy justify the cost of the diploma?

what is the downside if things don't work out exactly like my pie-in-the-sky fantasy?

these are really OBVIOUS questions

did you ask yourself any of them?

or were you too busy imagining a glamorous life in a cubicle where you never had to bust a sweat?

GTFOH, and take the stench of self-pity with you

Anonymous ID: 98b53b Dec. 23, 2025, 8:08 p.m. No.24022122   🗄️.is 🔗kun   >>2125 >>2129 >>2135

>>24022104

>Let that tard pay for all of the feral cats in the boonies to get neutered….

or just LEAVE THEM ALONE

nowhere else in the world do people obsess with feral cat populations

they breed until they reach a balance with the available food sources

just LEAVE THEM THE FUCK ALONE, you bloodlusting degenerate assfuck

Anonymous ID: 98b53b Dec. 23, 2025, 8:17 p.m. No.24022142   🗄️.is 🔗kun   >>2150

>>24022129

>they eat them

only chinks and gooks

one more reason to glass 'em

european cities have large feral cat populations

at least until the illegals arrived

 

FYI, during the bubonic plague, millions of cats were murdered bcs the illiterate tards of the day believed they were demons

actually, the plague was spread by fleas carried by rats…

rats that WOULD have been killed by the cats if the superstitious tards hadn't killed them all

Anonymous ID: 98b53b Dec. 23, 2025, 8:20 p.m. No.24022148   🗄️.is 🔗kun   >>2155 >>2165 >>2175

>>24022135

>The real world, and real feral cats, just doesn't work that way

how would you know if you kill them all?

i live in the boonies

feral cats are everywhere

same as feral dogs

and if ya had a BRAIN, you'd realize they got along just fine for MILLIONS of yrs before sick bastards like YOU showed up

you kill 'em bcs you LIKE IT

Jesus must be SO PROUD of you

Anonymous ID: 98b53b Dec. 23, 2025, 8:23 p.m. No.24022152   🗄️.is 🔗kun   >>2154

>>24022132

>Pretty sure cats aren't "brethren" dude.

pretty sure they are

maybe READ moar

animals are God's innocents

and they existed long before self-important assfucks like you came along

Anonymous ID: 98b53b Dec. 23, 2025, 8:25 p.m. No.24022157   🗄️.is 🔗kun   >>2168

>>24022150

>Do you keep cats in your house?

yes

and there are wild ones in the woods, too

none of them NEED culled

the only thing that needs culled are bloodlusting closet faggots who need guns bcs their penis is soft

Anonymous ID: 98b53b Dec. 23, 2025, 8:27 p.m. No.24022167   🗄️.is 🔗kun   >>2179

>>24022155

>You live in the boonies where feral dogs and feral cats have been roaming free for millions of years?

where did you THINK cats and dogs lived before humans came on the scene?

or do you believe in fairytales?

Anonymous ID: 98b53b Dec. 23, 2025, 8:50 p.m. No.24022235   🗄️.is 🔗kun   >>2243

all you sicko bastard championing the rando killing of stray cats…

even muzzies are more civilized than you

 

https://www.youtube.com/results?search_query=feral+cats+in+istanbul

 

now type in the name of any other european city, you will get identical vids

cats do NOT need to be culled

wish i could be there to see your faces on judgement day when you try to justify your bloodlust to God

Anonymous ID: 98b53b Dec. 23, 2025, 9:08 p.m. No.24022289   🗄️.is 🔗kun   >>2315

>>24022271

>Does anyone remember this? Had one of these. They remind me of a time when we use to make quality precision products in America, before we were copied by cheap plastic Chinese knockoffs.

those ARE plastic, you fuckwit

there was NOTHING quality OR precision about them

a bundle of plastic fibers stuck to the top of a light bulb

wooooooooo….. so quality….. so precision……

fiberoptic fibers ARE 100% plastic

Anonymous ID: 98b53b Dec. 23, 2025, 9:12 p.m. No.24022294   🗄️.is 🔗kun   >>2301

>>24022278

>ZIONIST ALERT!!!!

brigitte macron (jean-michel de rothschild) is the HEAD of the zionists in europe, you absolute fucking retard

>(You) are the one trying to discredit CO for exposing him/her/it