Nature Biotechnology
2009年2月封面文章:
基于安捷倫寡核苷酸序列合成技術的液相序列捕獲系統(tǒng) – 高通量平行靶向測序的最佳解決方案
來自美國麻省理工學院和哈佛大學Broad研究院的研究人員,利用安捷倫(Agilent Technologies)卓越的寡核苷酸序列合成技術,合成大量生物素標記的RNA“誘餌”用于靶向序列的捕獲,成功應用于高通量平行靶向測序。這一重大科學進展作為封面文章發(fā)表在最新一期的«自然•生物科技»雜志上。
下一代測序技術(Next Generation Sequencing)是對傳統(tǒng)測序技術的革命性變革,可以一次完成數(shù)十萬到數(shù)百萬條DNA分子的序列測定,使得在極短時間內對人類轉錄組和基因組進行細致研究成為可能。2003年,“人類基因組計劃”獲得首張人類全基因組圖譜,為了測定人類基因組的30億對堿基,數(shù)以千計的科學家們花費了超過30億美元和13年的時間。盡管與傳統(tǒng)測序技術相比,下一代測序技術的成本大幅降低,但目前測序一套個人基因組圖譜的成本仍然高達數(shù)十萬美元。所以,在現(xiàn)階段,如何實現(xiàn)對大批量樣本的高通量平行靶向測序,對許多科學和醫(yī)學研究課題具有重大意義。
為了實現(xiàn)這一目標,研究人員從生物芯片技術得到靈感,隨機選擇了1,900個人類基因,針對這些基因的15,565個編碼蛋白的外顯子序列(﹥2.5 Mb)以及另外4個基因組大片段序列(﹥1.7 Mb),根據(jù)tiling方法設計并且利用安捷倫的“SurePrint”技術合成了22,000條200-mers寡核苷酸探針。再經(jīng)PCR和反轉錄后得到足夠量的生物素標記的RNA“誘餌”,巧妙建立RNA-DNA的液相雜交系統(tǒng),對全基因組DNA中的目的序列進行成功捕獲,序列數(shù)據(jù)經(jīng)由Illumina公司的測序儀讀取。
測序結果表明:
1)極佳的特異性:高達90%的讀出序列來源于RNA“誘餌”捕獲,其中和靶向序列完全重合的超過50%;
2)優(yōu)秀的均一性:對基因組大片段序列,至少和捕獲序列重合一半以上的RNA“誘餌”超過80%,即使對小片段的外顯子序列,這一數(shù)字也超過60%;
3)卓越的重現(xiàn)性:2組技術重復試驗間的差異率不到10-5;
4)可以精確地檢測SNP。綜上所述,實驗結果完美驗證了基于液相序列捕獲的高通量平行靶向測序技術的特異性,準確性,重現(xiàn)性和廣泛的應用前景。
原文摘要:
Nature Biotechnology. 2009 Feb;27(2):182-9. Epub 2009 Feb 1.
Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing
Andreas Gnirke1, Alexandre Melnikov1, Jared Maguire1, Peter Rogov1, Emily M LeProust2, William Brockman1,5, Timothy Fennell1, Georgia Giannoukos1, Sheila Fisher1, Carsten Russ1, Stacey Gabriel1, David B Jaffe1, Eric S Lander1,3,4 & Chad Nusbaum1
Targeting genomic loci by massively parallel sequencing requires new methods to enrich templates to be sequenced. We developed a capture method that uses biotinylated RNA ‘baits’ to fish targets out of a ‘pond’ of DNA fragments. The RNA is transcribed from PCR-amplified oligodeoxynucleotides originally synthesized on a microarray, generating sufficient bait for multiple captures at concentrations high enough to drive the hybridization. We tested this method with 170-mer baits that target 415,000 coding exons (2.5 Mb) and four regions (1.7 Mb total) using Illumina sequencing as read-out. About 90% of uniquely aligning bases fell on or near bait sequence; up to 50% lay on exons proper. The uniformity was such that B60% of target bases in the exonic ‘catch’, and B80% in the regional catch, had at least half the mean coverage. One lane of Illumina sequence was sufficient to call high-confidence genotypes for 89% of the targeted exon space.
1Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA. 2Agilent Technologies Inc., 5301 Stevens Creek Blvd., Santa Clara, California 95051, USA. 3Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, Massachusetts 02139, USA. 4Department of Systems Biology, Harvard Medical School, 200 Longwood Ave., Boston, Massachusetts 02115, USA. 5Present address: Google, Inc., 5 Cambridge Center, Cambridge, Massachusetts 02142, USA. Correspondence should be addressed to A.G. (gnirke@broad.mit.edu).