Origin-1 논문 리뷰

들어가며

Zero-prior epitope 설정

Platform 구성: AbsciDiff, IgDesign2, AbsciBind

Search와 filtering cascade

In silico benchmark와 filtering score

Experimental target panel 결과

Lead optimization의 역할

Developability와 specificity evidence

Structural validation과 IL36RA function

실패 target을 같이 읽기

평가: full-length mAb design benchmark로서의 Origin-1

참고

Origin-1 논문 리뷰

들어가며

Antibody design을 general protein binder design과 같은 문제로 보면 중요한 부분을 놓치기 쉽습니다. Miniprotein binder는 비교적 열린 scaffold 공간에서 target surface에 맞는 작은 단백질을 찾는 문제에 가깝습니다. Full-length monoclonal antibody는 다릅니다. Heavy/light chain framework가 있고, CDR loop가 있고, epitope 접근 geometry가 있고, developability와 functional assay가 붙습니다.

Origin-1은 이 차이를 전면에 둔 bioRxiv preprint입니다. Absci / AbSciBio가 발표한 “Origin-1: a generative AI platform for de novo antibody design against novel epitopes”는 full-length mAb를 대상으로, 알려진 antibody-antigen complex나 protein-protein complex 구조가 없는 target site를 겨냥하는 zero-prior epitope design을 주장합니다. 이 글에서는 Origin-1을 open antibody design tool이라기보다, company-led full-length mAb design platform이 어떤 experimental evidence를 제시했는지 보는 review로 다루겠습니다.

이 논문은 headline만 보면 높은 성능의 항체 설계 논문처럼 보일 수 있습니다. 하지만 실제로는 더 조심스럽게 읽어야 하는 균형을 가집니다. 10개 design targets 중 4개에서 SPR/BLI/aSEC/developability/cryo-EM/function으로 이어지는 evidence가 나오고, 6개 target에서는 hit을 찾지 못했습니다. Parent hits도 대부분 micromolar 또는 weak binder이고, 가장 높은 affinity와 function은 lead optimization 이후에 나옵니다. 그래서 Origin-1은 “한 번에 항체를 잘 만든다”는 이야기보다, hard zero-prior epitope setting에서 weak parent hit을 찾고 optimization으로 끌어올리는 integrated platform story에 가깝습니다.

Zero-prior epitope 설정

Origin-1이 잡은 문제는 단순 CDR redesign이 아닙니다. 저자들은 zero-prior epitope를 target site에 대해 공개된 antibody-antigen complex나 protein-protein complex 구조가 없는 경우로 정의합니다. Target selection에서도 PDB에 antibody 또는 다른 protein과 complex가 없는 target을 고르고, protein-protein complex가 알려진 protein과도 sequence identity가 60% 이하가 되도록 제한했다고 설명합니다.

이 설정은 antibody design에서 어렵습니다. 이미 알려진 binder epitope나 native complex를 재사용하는 문제와 달리, 어디에 어떻게 접근할지부터 모델과 filtering이 결정해야 하기 때문입니다. Epitope residue를 지정하더라도, full-length mAb framework 안에서 그 epitope에 맞는 CDR geometry를 만드는 문제는 별도입니다.

다만 zero-prior를 “아무 prior가 없다”로 읽으면 안 됩니다. 여기서 prior가 없다는 말은 주로 public structural complex precedent가 없다는 뜻입니다. Target biology, surface geometry, chemical property, functional site, reagent availability 같은 정보는 여전히 target/epitope selection에 영향을 줄 수 있습니다. 따라서 Origin-1의 zero-prior setting은 blind target discovery라기보다, structural precedent가 없는 epitope를 겨냥한 full-length mAb design benchmark로 읽는 편이 안전합니다.

이 때문에 Origin-1의 결과는 hit rate 하나로만 보면 부족합니다. Target-level success, parent hit affinity, lead optimization, structural pose validation, developability, function을 나눠 읽는 편이 안전합니다. 특히 epitope targeting과 paratope geometry는 별도 evidence layer로 읽는 편이 안전합니다. Intended epitope를 지정했다는 것과, 실제 full-length mAb가 설계한 pose로 그 epitope에 결합했다는 것은 다른 evidence layer입니다.

Platform 구성: AbsciDiff, IgDesign2, AbsciBind

Origin-1은 세 모듈로 구성됩니다. AbsciDiff는 Boltz-1 checkpoint에서 fine-tune한 diffusion-based all-atom generative model입니다. Antigen structure, antibody framework, epitope residues를 condition으로 받고, CDR positions와 inter-chain pair features를 mask해 antibody-antigen complex structure를 생성합니다.

IgDesign2는 structure-conditioned paired CDR sequence design model입니다. PiFold-style GNN encoder, causal transformer decoder, frozen paired antibody language model IgBert refinement module을 결합합니다. Heavy/light chain을 따로 처리하는 일반 sequence design과 달리, paired antibody representation을 CDR sequence design에 넣으려는 시도입니다.

AbsciBind는 co-folding-based antibody-antigen scoring protocol입니다. AbsciBind Score는 default AbsciBind ipTM과 Antibody-Aligned ipTM의 평균으로 정의되고, design filtering과 lead optimization variant selection에 사용됩니다.

이 구성에서 볼 지점은 Origin-1이 generator-only paper가 아니라는 것입니다. AbsciDiff가 후보 구조를 만들고, IgDesign2가 paired CDR sequence를 설계하며, AbsciBind가 후보를 scoring/filtering합니다. 이후 SPR/BLI/aSEC/developability/cryo-EM/function으로 이어집니다. 따라서 결과는 model-only success가 아니라 generation, sequence design, filtering, assay가 결합된 pipeline-level evidence로 읽는 편이 안전합니다.

Search와 filtering cascade

Origin-1의 practical pipeline은 단순히 후보를 한 번 생성하고 끝나는 구조가 아닙니다. Supplementary figure 흐름에서는 Wide Structure Search, Deep Structure Search, Sequence Search처럼 compute를 점점 좁혀가며 score가 높은 후보로 concentration하는 방식이 제시됩니다. 이 단계들은 generated complex structure, CDR sequence, co-folding score를 함께 쓰는 search-and-filter cascade에 가깝습니다.

이 부분은 denominator를 해석할 때 중요합니다. Experimental denominator는 “모델이 생성한 전체 후보”가 아니라, 여러 단계의 구조 생성, sequence design, scoring, filtering을 통과해 실제 SPR/BLI로 간 후보입니다. Origin-1의 hit rate는 raw generation hit rate가 아니라 post-filter handoff result입니다.

이 cascade는 antibody design에서 자연스러운 선택입니다. Full-length mAb는 framework, paired heavy/light CDR, epitope 접근 각도, developability가 동시에 걸리기 때문에 raw sampling만으로는 실험 가능한 후보를 고르기 어렵습니다. 다만 그만큼 결과 해석도 pipeline-level로 두는 편이 안전합니다. AbsciDiff, IgDesign2, AbsciBind 중 하나의 단독 성능으로 환원하면 논문의 claim이 흐려집니다.

In silico benchmark와 filtering score

논문은 AbsciGen이 RFantibody보다 in silico benchmark에서 더 높은 AbsciBind Score 후보를 만든다고 보고합니다. Ten zero-prior-style targets에서 mean AbsciBind Score는 RFantibody 0.218 ± 0.092, AbsciGen 0.398 ± 0.129입니다. AbsciBind Score ≥ 0.5인 design 비율도 RFantibody 0.80%, AbsciGen 23.50%로 제시됩니다.

이 결과는 filtering score 관점에서는 의미가 있습니다. 하지만 AbsciBind Score는 wet-lab affinity나 specificity가 아닙니다. Co-folding confidence와 interface plausibility를 반영하는 proxy에 가깝습니다. 실제 design claim의 무게는 뒤쪽 experimental validation에서 결정됩니다.

Origin-1 리뷰에서 이 구분은 중요합니다. In silico benchmark가 높아졌다고 해서 full-length antibody hit가 자동으로 나온다는 뜻은 아닙니다. 실제로 experimental section에서는 10개 target 중 4개에서만 hit이 나옵니다. Proxy score는 candidate selection을 도울 수 있지만, hit discovery와 같은 것은 아닙니다.

Experimental target panel 결과

Origin-1은 10개 design targets로 experimental validation을 수행합니다. Positive hit이 나온 target은 COL6A3, AZGP1, CHI3L2, IL36RA 네 개입니다. FOLR1, FOLR2, CLEC4A, AMBP, ALCAM, KLK1 여섯 target에서는 hit을 찾지 못했습니다. TL1A와 PRLR은 off-target specificity controls로 사용됩니다.

Main text 기준 SPR hit은 COL6A3 3개, AZGP1 4개, CHI3L2 1개, IL36RA 1개입니다. 이후 BLI에서는 mAb format 기준 COL6A3 1개와 AZGP1 2개가 확인됩니다. CHI3L2와 IL36RA는 BLI confirmation이 약해서 low-affinity hit으로 보고 lead optimization으로 넘어갑니다.

GitHub repository의 strict Binder label은 더 보수적입니다. Successful-target 중심 CSV에서 AZGP1 2/95, COL6A3 1/95, CHI3L2 1/95, IL36RA 1/94처럼 표기됩니다. Main text의 broader SPR-identified hit count와 repository label이 완전히 같은 denominator를 쓰는 것은 아닌 것으로 보입니다. Publication-facing하게는 target-level 4/10 success와 per-target 약 94–100 design screening을 함께 적고, main text hit count와 repository strict label을 섞지 않는 편이 안전합니다.

이 결과의 해석은 두 층으로 나뉩니다. 첫째, zero-prior-like full-length mAb setting에서 4개 target에서 parent hit을 찾은 것은 의미 있는 result입니다. 둘째, 동시에 6개 target에서는 hit을 찾지 못했고, 성공 target에서도 parent hits는 대체로 weak합니다. Origin-1은 broad one-shot success라기보다, 일부 target에서 downstream optimization 가능한 starting point를 찾은 사례입니다.

Lead optimization의 역할

Origin-1에서 가장 조심해서 읽어야 할 부분은 parent hit와 optimized hit의 차이입니다. Parent affinities는 대체로 micromolar range입니다. Repository 기준 IL36RA parent KD는 6066.7 nM, COL6A3 parent KD는 1940 nM, CHI3L2 parent KD는 3460 nM, AZGP1 parent KD는 1420 nM 또는 4500 nM 수준입니다.

First-round lead optimization은 parent design당 94 single-mutant variants를 만들고, ESM 계열 language model과 AbsciBind Score를 함께 사용해 variants를 고릅니다. 이 단계에서 IL36RA는 89.425 nM, COL6A3는 174 nM, CHI3L2는 880 nM, AZGP1는 193/540 nM 수준까지 좋아집니다.

Second-round optimization에서는 IL36RA에서 14개 sub-nanomolar binders, COL6A3에서 3개 sub-nanomolar binders를 얻었다고 보고합니다. CSV에서 확인되는 최저 KD는 IL36RA human KD 0.381 nM, mouse KD 3.135 nM, COL6A3 KD 0.396 nM입니다.

이 흐름은 Origin-1의 성격을 잘 보여줍니다. 가장 높은 affinity claim은 one-shot de novo parent design에서 바로 나온 것이 아니라, AI-guided lead optimization 이후에 나옵니다. 따라서 Origin-1은 de novo hit discovery와 lead optimization을 분리하지 않으면 과대평가하기 쉽습니다.

Developability와 specificity evidence

Origin-1은 selected binders에 대해 developability panel을 수행합니다. Panel에는 polyreactivity, polydispersity, self-association, hydrophobicity, melting temperature, purity가 포함됩니다. COL6A3, CHI3L2, IL36RA parent binders는 대체로 기준을 통과했고, IL36RA는 hydrophobicity flag 하나가 있었습니다. AZGP1 top binder는 self-association, hydrophobicity, polyreactivity flag가 있었습니다.

Table 4에서 parent binders의 melting temperature는 COL6A3 Tm1 68.0°C, AZGP1 77.86°C, IL36RA 70.6°C, CHI3L2 70.2°C로 제시됩니다. Optimized IL36RA variants도 Table 5에서 early developability profile을 유지하는 쪽으로 보고됩니다. 이 점은 affinity maturation이 developability를 망가뜨릴 수 있다는 일반적인 우려를 생각하면 검토할 만한 부분입니다.

이 결과는 full-length mAb design에서 의미가 있습니다. Antibody hit은 target binding만으로 충분하지 않고, developability 문제가 바로 다음 병목이 되기 때문입니다. 다만 이 panel은 early in vitro developability evidence입니다. Formulation stability, PK/PD, immunogenicity, toxicity, CMC readiness까지 보여주는 것은 아닙니다.

Specificity는 intended target binding, unintended designs/antigens, TL1A/PRLR off-target controls를 통해 평가됩니다. 이것은 useful screening layer지만, proteome-wide specificity나 in vivo safety와 같지는 않습니다.

Structural validation과 IL36RA function

Origin-1의 중요한 evidence layer는 structural validation입니다. Cryo-EM은 COL6A3, AZGP1, IL36RA 세 complexes에 대해 수행되었습니다. Reported resolution은 각각 3.0 Å, 3.3 Å, 3.1 Å이고, designed structure와 experimental structure의 DockQ는 COL6A3 0.83, AZGP1 0.91, IL36RA 0.87입니다. Interface RMSD는 0.96 Å, 0.49 Å, 0.78 Å입니다.

이 결과는 단순 binding hit보다 evidence layer가 두껍습니다. De novo antibody design에서는 target에 붙는 것뿐 아니라 설계한 epitope와 pose가 실제로 맞는지가 볼 지점입니다. Origin-1은 적어도 세 selected complexes에서 designed pose가 experimental structure와 잘 맞는다는 evidence를 보여줍니다.

Functional validation은 IL36RA에 한정됩니다. Parent IL36RA design은 HEKBlue assay에서 functional하지 않았지만, first-round affinity-matured variants는 antagonism을 보였고 최고 EC50 104 nM이었습니다. Second-round variants에서는 최고 EC50 12.3 nM을 보고했고, mouse CT-26 + CCL2 assay에서도 functional antagonism을 확인했습니다.

따라서 Origin-1의 functional claim은 IL36RA-specific입니다. COL6A3, AZGP1, CHI3L2까지 broad functional validation이 된 것은 아닙니다. Binding, pose, developability, function을 같은 층위로 합치지 않는 것이 중요합니다.

실패 target을 같이 읽기

Origin-1에서 실패 target은 부록이 아니라 main story의 일부입니다. FOLR1, FOLR2, CLEC4A, AMBP, ALCAM, KLK1 여섯 target에서는 hit을 찾지 못했습니다. 이 점은 Origin-1의 claim을 낮추는 동시에, zero-prior full-length mAb design이 여전히 어려운 문제라는 사실을 보여줍니다.

이 실패들은 publication-facing 해석에서 같이 다뤄야 합니다. 성공한 4개 target만 보면 Origin-1이 robust platform처럼 보일 수 있지만, 전체 10개 target을 보면 partial target success입니다. Parent hit도 weak한 경우가 많고, 가장 높은 affinity/function outcome은 optimization 이후에 나옵니다.

이 구조는 antibody design benchmark를 읽을 때 좋은 기준이 됩니다. Target-level success, candidate-level hit rate, parent affinity, optimized affinity, function, pose validation, developability를 각각 따로 봐야 합니다. Origin-1은 그 구분을 연습하기 좋은 case입니다.

평가: full-length mAb design benchmark로서의 Origin-1

내가 보기에 Origin-1은 “antibody design을 해결했다”는 자료가 아닙니다. 더 정확히는 full-length mAb design에서 zero-prior epitope를 겨냥하고, 일부 target에서 weak parent hit을 찾은 뒤, optimization과 multi-layer validation으로 candidate를 끌어올린 사례입니다.

이 점에서 Origin-1은 Chai-2나 Protenix-v2와도 다릅니다. Chai-2와 Protenix-v2는 low-N antibody/VHH-Fc hit discovery의 performance milestone에 가깝습니다. Origin-1은 full-length mAb, zero-prior epitope, lead optimization, cryo-EM pose validation, IL36RA function이 결합된 더 좁고 깊은 story입니다.

그래서 이 preprint의 안전한 해석은 다음에 가깝습니다. Origin-1은 full-length antibody design이 여전히 어렵다는 것을 숨기지 않으면서, generation, paired CDR sequence design, co-folding score, lead optimization, and validation stack을 결합하면 일부 zero-prior-like target에서 실험적으로 설득력 있는 결과를 만들 수 있음을 보여줍니다. 성공과 실패를 같이 놓을 때 이 자료의 의미가 가장 명확해집니다.

참고

•

Levine et al., “Origin-1: a generative AI platform for de novo antibody design against novel epitopes”, bioRxiv preprint, 2026. https://www.biorxiv.org/content/10.64898/2026.01.14.699389

•

Absci / AbSciBio Origin-1 data repository: https://github.com/AbSciBio/origin-1