Table Of Content

Stivalaetal.BMCBioinformatics2010,11:446 http://www.biomedcentral.com/1471-2105/11/446 METHODOLOGY ARTICLE Open Access Fast and accurate protein substructure searching with simulated annealing and GPUs Alex D Stivala1*, Peter J Stuckey1,2, Anthony I Wirth1 Abstract Background: Searching a database of protein structures for matches to a query structure, or occurrences of a structural motif, is an important task in structural biology and bioinformatics. While there are many existing methods for structural similarity searching, faster and more accurate approaches are still required, and few current methods are capable of substructure (motif) searching. Results: We developed an improved heuristic for tableau-based protein structure and substructure searching using simulated annealing, that is as fast or faster and comparable in accuracy, with some widely used existing methods. Furthermore, we created a parallel implementation on a modern graphics processing unit (GPU). Conclusions: The GPU implementation achieves up to 34 times speedup over the CPU implementation of tableau- based structure search with simulated annealing, making it one of the fastest available methods. To the best of our knowledge, this is the first application of a GPU to the protein structural search problem. Background number of residues is naturally much larger than the Searching a database of protein structures for structures number of SSEs, these methods must solve problems of that are similar to, or contain substructures that are a larger size than SSE-based methods. similar to, a query structure is a significant problem in SHEBA [16] and YAKUSA both use a one-dimen- structural biology and bioinformatics. We can classify sional representation of protein structure to accelerate methods for protein structural searches into four cate- structural searching. SHEBA uses “environmental pro- gories. First, methods that align proteins directly at the files” containing information about sequence homology level of residues. Dali and DaliLite [1,2] fall into this and residue-dependent information such as solvent category. Second, methods that align proteins at the accessibility, hydrogen bonds, and side-chain packing, level of secondary structure elements (SSEs). Tableau- which is then refined for three-dimensional geometry by Search [3], ProSMoS [4], and the TOPS-based methods dynamic programming. YAKUSA is a fast method that [5,6] fall into this category. Third, methods that perform uses a one-dimensional representation based on protein an initial alignment at the level of SSEs, and then extend internal angles. it to a residue level alignment. VAST [7,8], SSM [9], We note that the classification just described is not LOCK2 [10], and SARF2 [11] fall into this category. strict or even exclusive. For example, we place SHEBA Fourth, methods that do not perform an alignment at in the first category since it provides a residue-level all, but use some other means of providing a similarity alignment, but it does not do so directly (using its score. YAKUSA [12] and PRIDE [13-15] fall into this “environmental profiles” as a first step), so it could be category. Methods in the first category tend to be the considered as partly belonging to the fourth category, slowest, since they are not necessarily designed solely or with an extra stage bringing it into the first category. primarily for database scanning, but also to provide a However SHEBA certainly does not belong to the sec- set of correspondences between residues. Since the ond or third categories since it does not perform matching at the SSE level. Note also that any method in the second category (SSE alignment) can be transformed *Correspondence:[email protected] 1DepartmentofComputerScienceandSoftwareEngineering,TheUniversity into a method in the third category by adding to it ofMelbourne,Victoria3010,Australia Fulllistofauthorinformationisavailableattheendofthearticle ©2010Stivalaetal;licenseeBioMedCentralLtd.ThisisanOpenAccessarticledistributedunderthetermsoftheCreativeCommons AttributionLicense(http://creativecommons.org/licenses/by/2.0),whichpermitsunrestricteduse,distribution,andreproductionin anymedium,providedtheoriginalworkisproperlycited. Stivalaetal.BMCBioinformatics2010,11:446 Page2of17 http://www.biomedcentral.com/1471-2105/11/446 some method of extending the SSE alignment to a resi- residues that are also in contact in the other [26]. Since due alignment. MAX-CMO operates at the level of residues, it belongs Some recent methods use information about SSE to the first of the four categories we have described. orientation to match proteins by global geometric infor- MAX-CMO has been solved exactly by various meth- mation. TableauSearch and IR Tableau [17] make use of ods including Lagrangian relaxation [27,28] and branch- tableaux [18] to perform rapid and accurate comparison and-bound [29], but these methods are often impracti- of whole protein structures. QP Tableau Search [19], cally slow and so heuristics have been used to approxi- another tableau-based method, is closer to the original mate the solution [30]. However, compared to the quadratic integer programming (QIP) formulation MSVNS4MaxCMO heuristic [30], it has been shown defined by Konagurthu et al. [3] and (unlike the other that a tableau-based method is faster and has equal or methods mentioned previously) allows substructure greater accuracy in ranking protein similarity [19]. (motif) searching and non-sequential matchings. The In this paper, we demonstrate SA Tableau Search, a latter refers to sets of correspondences between SSEs in new heuristic for tableau-based protein structural and which the sequential order of corresponding SSEs is not substructure searching based on simulated annealing preserved. It is, however, considerably slower than most [31], and a parallel implementation of it on a modern other (non-alignment) methods. ProSMoS [4] also general-purpose graphics processing unit (GPGPU). We makes use of SSE orientations, but takes quite a differ- compare its accuracy as a fold classification method ent approach from most other methods. It does not with the existing methods DaliLite, SHEBA, VAST, compare structures against each other, but rather a YAKUSA, SARF2, LOCK2, TOPS, TableauSearch, QP query motif is defined by the user as a “meta-matrix”, Tableau Search, and IR Tableau on three different data which is then used to find structures that contain a sub- sets: a set of 200 queries in the ASTRAL SCOP 1.75 structural motif which matches the query motif thus 95% sequence identity non-redundant database [32,33], defined. all-against-all queries in the Fischer data set [34], and Methods that use SSEs require some method for the COPS benchmark [35]. Because SA Tableau Search assigning secondary structure to proteins. That is, some has a parameter, the number of restarts of its simulated method to classify residues in the protein as belonging annealing schedule, that can be adjusted as a tradeoff to helices (of various kinds) or strands (as part of a b- between speed and accuracy, we perform these compari- sheet), or not part of an SSE. This can be done by a sons with a number of different values of this parameter. variety of methods, such as pattern recognition of Three of the methods we compare SA Tableau Search hydrogen bonds and geometrical features, as in probably with are also tableau-based methods: TableauSearch, QP the best-known method, DSSP [20], or by hydrogen Tableau Search, and IR Tableau. Although the tableau- bonds and statistically derived backbone torsional angle based methods generally belong to the second category information as in STRIDE [21]. Assignment of second- described above, since tableaux are defined in terms of ary structure is not an exact procedure, with various SSEs, IR Tableau is an exception, belonging instead to methods disagreeing about the exact beginning and end the fourth class. By reducing the tableau representation of SSEs, and particular SSEs depart significantly from to feature vectors which consist of counts of the number the “ideal” model of Pauling and Corey [22-24]. In addi- of different SSE orientation relationships in a structure’s tion, using only SSEs means that regions of protein tableau, IR Tableau can compare structures by cosine structure not defined as being part of an SSE are not similarity, that is, just the cosine of the angle between used at all, which can lead to less sensitive results. In the feature vectors. This is extremely fast, but results in some cases, this excludes a structure from being pro- no alignment of SSEs, only a similarity score. Tableau- cessed at all, if the method of assignment determines Search is described in the original tableau-based protein that it contains no SSEs. At least a partial solution to structure searching paper as an approximation to the this can be to use a method of defining SSEs that maximally-similar subtableaux extraction problem, assigns a larger fraction of the structure to SSEs, as is which is an NP-hard problem [3]. It uses an alignment- done by ProSMoS, which uses the PALSSE method [25], like approach with two phases of dynamic programming, specifically designed for use in protein structure similar- and is fast (but not as fast as IR Tableau), but is inher- ity searching. ently sequential (it cannot find non-sequential structural Another protein structure comparison method is max- matchings), and cannot find substructure matchings imum contact map overlap (MAX-CMO), which consists (motifs). Unlike IR Tableau, however, TableauSearch is of finding an alignment of residues in two proteins that capable of providing a set of correspondences between maximizes the overlap between their contact maps. That SSEs. QP Tableau Search is another method of approxi- is, it maximizes the number of contacts where residues mating a solution to this problem, by relaxing the origi- that are in contact in one protein are aligned with nal QIP to a quadratic program (QP) [19]. This means Stivalaetal.BMCBioinformatics2010,11:446 Page3of17 http://www.biomedcentral.com/1471-2105/11/446 that some of the desirable properties of the original (GPUs) have recently been used for several bioinfor- (exact) formulation are retained, specifically that non- matics applications, such as sequence alignment [36-39], sequential and substructure matches can be found. In molecular dynamics [40,41], microarray data analysis addition, QP Tableau Search introduces an additional [42], mass spectrometry data analysis [43], and phyloge- constraint, that the difference in the distances between netics [44]. However, despite the large variety of protein matched SSEs cannot exceed a threshold, which is structural search methods and their computationally found to increase accuracy when used as a protein intensive nature, to the best of our knowledge this is the structural database scanning method assessed on fold first use of GPUs to accelerate protein structural or sub- classification. structural searching. Of the methods described so far, only ProSMoS, TOPS, QP Tableau Search and SA Tableau Search are Results and Discussion capable of substructure (motif) queries, and in fact Protein structure search (fold classification) ProSMoS is designed specifically to search for manually Figure 1 shows the AUC (area under the ROC curve – defined motifs in a database of structures, rather than see Methods section) and Table 1 shows AUC and structure similarity searching. In both these cases the elapsed time for several methods run on a set of 200 motifs are defined at the SSE (not residue) level. None queries in the ASTRAL SCOP 1.75 95% sequence iden- of the residue-based methods (such as DaliLite and tity non-redundant subset. Run with 128 restarts, SA SHEBA) are designed to allow a motif (substructure) Tableau Search is one of the faster methods even on the query, as they assess structural similarity to find com- host CPU, taking approximately the same time as mon regions between two structures and find an opti- YAKUSA: only TableauSearch, TOPS, and (especially) mal superposition. As discussed in the description of IR Tableau are faster. Run with 4096 restarts, SA ProSMoS [4], motifs, in contrast, are defined by “the Tableau Search is one of the most accurate methods, main-chain topology, and general orientation and pack- with AUC not statistically significantly different from ing of secondary structural elements” [[4], p. 1331]. that of SHEBA (Table 2), but considerably slower. Note TOPS allows specification of such motifs, but operates that the results for SARF2 are not included for this data on topological similarity only, while the QP Tableau set as it is unable to process a data set this large, and Search and SA Tableau Search algorithms use tableaux, DaliLite results are omitted, as it exceeds a CPU time which are based on SSE orientation, and distance limit of 400 hours. matrices. ProSMoS makes use of “meta-matrices” which Considering now the GPU implementation of SA include information on SSE orientation and contacts Tableau Search, we can see that on average the GTX including hydrogen-bonding. Graphics processing units 285 card provides a 33 times speedup (34 times for Figure1AUCfordifferentmethodsfor200queriesintheASTRAL95%dataset.AreaundertheROCcurve(AUC)fordifferentmethods onthe200querysetagainsttheASTRALSCOP95%sequenceidentitynon-redundantdatabase,excludingcomparisonsofastructureagainst itself.EachAUCvalueisshownwitha95%confidenceinterval.SAreferstoSATableauSearch(followedbynumberofrestarts)andQPrefersto QPTableauSearch.SARF2isnotpresentinthisgraph,asitisunabletoprocessadatasetthislarge,andDaliLiteisnotpresentasitexceeds theCPUtimelimit. Stivalaetal.BMCBioinformatics2010,11:446 Page4of17 http://www.biomedcentral.com/1471-2105/11/446 Table 1AUC andelapsed timefor 200queries in theASTRAL 95% data set 95%confidenceinterval Method Platform Restarts Elapsedtime AUC Standarderror lower upper SHEBA CPU - 25h22m 0.931 0.004 0.924 0.938 SATableauSearch CPU 4096 142h42m 0.930 0.004 0.923 0.937 SATableauSearch GTX285 4096 4h11m 0.930 0.004 0.923 0.937 SATableauSearch TeslaC1060 4096 5h40m 0.930 0.004 0.923 0.937 SATableauSearch CPU 128 4h18m 0.921 0.004 0.913 0.929 SATableauSearch GTX285 128 0h08m 0.920 0.004 0.912 0.927 SATableauSearch TeslaC1060 128 0h11m 0.920 0.004 0.912 0.927 QPTableauSearch CPU - 157h51m 0.914 0.004 0.906 0.922 LOCK2 CPU - 208h09m 0.912 0.004 0.904 0.920 TableauSearch CPU - 1h11m 0.858 0.005 0.848 0.867 TOPS CPU - 1h11m 0.855 0.005 0.845 0.864 VAST CPU - 14h26m 0.855 0.005 0.846 0.865 IRTableau CPU - 0h01m 0.854 0.005 0.844 0.864 YAKUSA CPU - 4h14m 0.827 0.005 0.816 0.837 AreaundertheROCcurve(AUC)andelapsedtimefordifferentmethodsonthe200querysetagainsttheASTRALSCOP95%sequenceidentitynon-redundant database.ThetableissortedbyAUCdescending. InthePlatformcolumn,eitheraGPUcardisdescribed,orCPU,whichisasinglecoreofanAMDQuadCoreOpteron(2.3GHz,32GBRAM)runningLinux. SARF2isnotpresentinthistable,asitisunabletoprocessadatasetthislarge,andDaliLiteisnotpresentasitexceedstheCPUtimelimit. 4096 restarts), and the Tesla C1060 card provides on significant difference between the value for 4096 and for average a 24 times speedup. SA Tableau Search run 8192 restarts (Table 2). Hence, although changing the with 4096 restarts on the GTX 285 takes 4 hours and number of restarts allows a user-selectable tradeoff 11 minutes, making it not only one of the most accurate between accuracy and speed, one runs into rapidly methods considered, but also one of the fastest. Figure 2 diminishing returns for numbers of restarts beyond a shows that, while elapsed time obviously increases line- certain point; on this evaluation data set, using more arly with the number of restarts (above 128, the number than 4096 restarts does not increase the AUC, and 512 run in parallel), the accuracy as measured by the AUC restarts (taking therefore approximately 1/8 the time of figure increases much more slowly, and there is no 4096 restarts) is enough to achieve the same accuracy (within statistical significance at p-value 0.05) as SHEBA but in just 31 minutes 25 seconds on the GTX 285. On Table 2ΔAUCrelative to SA Tableau Searchfor 200 the host CPU, this takes 17 hours 53 minutes, and queries inthe ASTRAL 95% dataset SHEBA takes over 25 hours. Method(s) ΔAUC Figure 3, Table 3, and Table 4 show the AUC and SATableauSearch4096,SHEBA,SATableauSearch8192 0.000 elapsedtimeforall-against-allqueriesintheFischerdata SATableauSearch2048 0.0007 set. This data set is much smaller, but was constructed SATableauSearch1024 0.0012 specificallytobenchmarkfoldrecognitionandsocontains SATableauSeach512 0.0023 structurallysimilarproteinswithverylowsequencesimi- SATableauSearch256 0.0039 larity [34]. In this data set, QP Tableau Search, SHEBA, SATableauSearch128 0.0090 DaliLite and LOCK2 all have statistically significantly QPTableauSearch 0.0160 higherAUCvaluesthanSATableauSearch,althoughthe LOCK2 0.0183 latterwhenrunonaGPUcardisconsiderablyfasterthan TableauSearch 0.0723 thesemethods.WefindthatIRTableauworksparticularly VAST 0.0746 wellonthisdataset,withanAUCnotstatisticallysignifi- TOPS 0.0756 cantlydifferentfromthatofSATableauSearch,andfaster IRTableau 0.0761 (on the host CPU) than even SA Tableau Search on the YAKUSA 0.1034 GTX285card(thefastestavailabletous). DifferenceinAUCrelativetoSATableauSearch(4096restarts)fordifferent Figure 4, Table 5, and Table 6 show the AUC and methodsonthe200querysetagainsttheASTRALSCOP95%sequence elapsed time for the queries in the COPS benchmark identitynon-redundantdatabase.ThetableissortedbyΔAUC,sothat methodswithlowerAUCthanSATableauSearch(ΔAUC>0,withp-value< [35]. DaliLite, SHEBA, YAKUSA, and SARF2 have the 0.05)areatthebottomofthetable.Methodsforwhichthereisno bestAUCvaluesforthisdataset,followedbySATableau statisticallysignificantdifferenceinAUCfromSATableauSearch(4096 SearchwhichhasanAUCvaluenotsignificantlydifferent restarts)atp-value0.05areshowninasinglerowwithΔAUC=0.000. Stivalaetal.BMCBioinformatics2010,11:446 Page5of17 http://www.biomedcentral.com/1471-2105/11/446 Figure2AUCversuselapsedtimeforSATableauSearch.AUCversuselapsedtimeforSATableauSearchondifferentplatformsforthe200 querysetagainsttheASTRALSCOP95%sequenceidentitynon-redundantdatabase.EachdatapointisanAUCvalue,andislabelledwiththe numberofrestartsthatSATableauSearchisrunwith. from that of VAST and LOCK2. Although it is the best Part of this benchmark consists of an assignment of pro- performingonAUC,DaliLiteisbyaconsiderablemargin tein sequences to their most compatible fold, on purely the slowest program. In this benchmark, TOPS and the structural criteria, independent of the protein represen- other tableau based methods have significantly lower tation and similarity scores used by the fold recognition AUCthantheothermethods. methods to be assessed [34]. This benchmark has since The Fischer and COPS data sets have quite different been used, as we do here, for the somewhat different properties. The Fischer data set was designed to bench- purpose of benchmarking methods that compare two mark fold recognition methods, independent of the known protein structures, for example in Pelta et al. structural similarity score being tested. That is, it was [30]. No two proteins in the data set have sequence originally constructed to benchmark methods for assign- similarity over 35% [34], and it is quite small, consisting ing a sequence of unknown structure to a known fold. of only 68 structures. We perform all-against-all queries Figure 3 AUC for different methods in the Fischer data set. Area under the ROC curve (AUC) for different methods for all-against-all comparisonsintheFischerdataset,excludingcomparisonsofastructureagainstitself.EachAUCvalueisshownwitha95%confidence interval.SAreferstoSATableauSearch(followedbynumberofrestarts)andQPreferstoQPTableauSearch. Stivalaetal.BMCBioinformatics2010,11:446 Page6of17 http://www.biomedcentral.com/1471-2105/11/446 in this data set, resulting in 4624 pairwise comparisons. Table 4ΔAUCrelative to SA Tableau Searchin the The COPS benchmark, in contrast, was designed to Fischer data set benchmark sequence similarity database searches, and Method(s) ΔAUC true positives in this benchmark are defined by the QPTableauSearch -0.0533 COPS classification [35,45]. This classification is defined SHEBA -0.0409 by structural similarity according to the TopMatch DaliLite -0.0399 structural alignment algorithm [46,47], and so the COPS LOCK2 -0.0379 benchmark in fact assesses a method according to its SATableauSearch4096,IRTableau 0.000 agreement with TopMatch. SATableauSearch1024 0.0055 We note that in these last two benchmark data sets SATableauSearch128 0.0143 (Fischer and COPS), in both cases, four methods have a SARF2 0.0398 statistically significantly higher AUC than SA Tableau YAKUSA 0.0954 Search. However, it is a different set of four methods in TOPS 0.1449 the two cases, with SHEBA and DaliLite in common. TableauSearch 0.1744 Hence, given the different purposes of the Fischer and VAST 0.2365 COPS benchmarks, it would seem that SHEBA and DifferenceinAUCrelativetoSATableauSearch(4096restarts)methodsfor DaliLite are superior for both “difficult” (distantly all-against-allcomparisonsintheFischerdataset.Thetableissortedby related and low sequence similarity) and “easy” (more ΔAUC,sothatmethodswithhigherAUCthanSATableauSearch(ΔAUC<0, withp-value<0.05)areatthetopofthetable,andthosewithlowerAUC closely related) database searching tasks. YAKUSA and thanSAtableausearch(ΔAUC>0,withp-value<0.05)areatthebottomof SARF2 perform better only on the more closely related thetable.Methodsforwhichthereisnostatisticallysignificantdifferencein AUCfromSATableauSearch(4096restarts)atp-value0.05areshownina searches (or, to be precise, agree more closely with Top- singlerowwithΔAUC=0.000. Match), and LOCK2 and QP Tableau Search perform better on the more distantly related searches. methods, with no statistically significant difference in In summary, over the three benchmark data sets we AUC (at p-value 0.05), in the ASTRAL 95% 200 query tested, no one method is consistently the best perform- benchmark. This data set is the largest one, but it also ing, although SHEBA is consistently in the top two contains many similar structures and sequences. In all methods measured by AUC, and it is considerably faster three benchmark data sets, SA Tableau Search has a than DaliLite, the only method that has a higher accu- (statistically significantly) higher AUC than TOPS and racy on one of the benchmarks. All the other methods TableauSearch, and in all but COPS (where it has a not tested appear at different ranks in different benchmarks. significantly different AUC), it has a higher AUC than SHEBA and SA Tableau Search are the top two ranking VAST. Table 3AUC andelapsed timein the Fischer data set 95%confidenceinterval Method Platform Restarts Elapsedtime AUC standarderror lower upper SHEBA CPU - 03m47s 0.877 0.016 0.845 0.909 DaliLite CPU - 113m34s 0.876 0.016 0.845 0.908 LOCK2 CPU - 32m12s 0.874 0.016 0.842 0.907 IRTableau CPU - <1s 0.859 0.015 0.830 0.888 QPTableauSearch CPU - 52m10s 0.848 0.018 0.813 0.882 SATableauSearch CPU 4096 12m07s 0.837 0.018 0.801 0.872 SATableauSearch TeslaC1060 4096 00m55s 0.836 0.018 0.800 0.871 SATableauSearch GTX285 4096 00m40s 0.829 0.018 0.792 0.865 SATableauSearch CPU 128 00m25s 0.822 0.019 0.786 0.859 SATableauSearch GTX285 128 00m02s 0.809 0.019 0.771 0.846 SATableauSearch TeslaC1060 128 00m02s 0.804 0.019 0.767 0.842 SARF2 CPU - 19m34s 0.797 0.020 0.759 0.835 YAKUSA CPU - 00m02s 0.741 0.021 0.700 0.782 TOPS CPU - 01m05s 0.692 0.022 0.649 0.734 TableauSearch CPU - <1s 0.662 0.022 0.619 0.705 VAST CPU - 02m33s 0.600 0.022 0.557 0.643 AreaundertheROCcurve(AUC)andelapsedtimefordifferentmethodsforall-against-allcomparisonsintheFischerdataset.ThetableissortedbyAUC descending.InthePlatformcolumn,eitheraGPUcardisdescribed,orCPU,whichisasinglecoreofanAMDQuadCoreOpteron(2.3GHz,32GBRAM)running Linux. Stivalaetal.BMCBioinformatics2010,11:446 Page7of17 http://www.biomedcentral.com/1471-2105/11/446 Figure4AUCfordifferentmethodsintheCOPSbenchmarkdataset.AreaundertheROCcurve(AUC)fordifferentmethodsfortheCOPS benchmarkdataset.EachAUCvalueisshownwitha95%confidenceinterval.SAreferstoSATableauSearch(followedbynumberofrestarts) andQPreferstoQPTableauSearch. It is worth noting that QP Tableau Search and SA host CPU than QP Tableau Search, but its simplicity Tableau Search use exactly the same formulation of the allows parallelization on GPU cards, which are currently protein substructure search problem as the extraction of quite restricted in some respects, such as availability of maximally-similar subtableaux first described by Kona- sophisticated math libraries. For example, the complex- gurthu et al. [3], enhanced by distance difference con- ity of QP Tableau Search in its use of an interior point straints introduced in Stivala et al. [19]. The difference solver makes it impossible for us to implement a GPU is in the method of approximation: QP Tableau Search version. As will be described in detail in the Methods relaxes the problem to a QP to find a (locally) optimal section, there are two levels of parallelization in our solution, while SA Tableau Search, described here, uses implementation: first, each restart of the simulated simulated annealing. Not only is the latter method faster annealing schedule for a single comparison of two struc- and often more accurate (with sufficient restarts) on the tures is run in parallel, and second, multiple Table 5AUC andelapsed timein COPSbenchmark data set 95%confidenceinterval Method Platform Restarts Elapsedtime AUC standarderror lower upper DaliLite CPU - 123h29m47s 0.992 0.002 0.989 0.996 SHEBA CPU - 6h04m16s 0.978 0.003 0.971 0.984 YAKUSA CPU - 0h02m25s 0.975 0.003 0.969 0.982 SARF2 CPU - 13h57m34s 0.972 0.004 0.964 0.978 SATableauSearch CPU 4096 8h41m18s 0.957 0.004 0.949 0.966 SATableauSearch GTX285 4096 0h26m59s 0.957 0.004 0.949 0.966 SATableauSearch TeslaC1060 4096 0h30m30s 0.957 0.004 0.948 0.965 LOCK2 CPU - 72h11m06s 0.954 0.005 0.945 0.962 SATableauSearch CPU 128 0h15m17s 0.951 0.005 0.942 0.960 VAST CPU - 1h37m53s 0.950 0.005 0.941 0.959 SATableauSearch TeslaC1060 128 0h03m00s 0.948 0.005 0.939 0.958 SATableauSearch GTX285 128 0h03m02s 0.947 0.005 0.938 0.957 QPTableauSearch CPU - 74h47m43s 0.925 0.006 0.914 0.936 TOPS CPU - 0h18m25s 0.881 0.007 0.868 0.894 IRTableau CPU - <1s 0.845 0.008 0.831 0.860 TableauSearch CPU - 0h18m07s 0.840 0.008 0.825 0.855 AreaundertheROCcurve(AUC)andelapsedtimefordifferentmethodsontheCOPSbenchmarkdataset.ThetableissortedbyAUCdescending.Inthe Platformcolumn,eitheraGPUcardisdescribed,orCPU,whichisasinglecoreofanAMDQuadCoreOpteron(2.3GHz,32GBRAM)runningLinux. Stivalaetal.BMCBioinformatics2010,11:446 Page8of17 http://www.biomedcentral.com/1471-2105/11/446 Table 6ΔAUCrelative to SA Tableau Searchin the COPS level is to parallelize each comparison, this would benchmark data set require a parallelization of both the dynamic program- Method(s) ΔAUC ming procedure and the three-dimensional translation DaliLite -0.0349 and rotation procedure used iteratively by SHEBA [16]. SHEBA -0.0201 This is a much more challenging task than simply run- YAKUSA -0.0180 ning the multiple restarts of SA Tableau Search in SARF2 -0.0140 parallel. SATableauSearch4096,VAST,LOCK2 0.000 SATableauSearch128 0.0064 Substructure queries QPTableauSearch 0.0325 Evaluating the accuracy of substructure (motif) queries TOPS 0.0764 in a quantitative and objective way such as AUC is quite IRTableau 0.1120 challenging; there is no database such as SCOP to pro- TableauSearch 0.1172 vide a set of all true occurrences of a motif in general. We therefore provide two examples where we can pro- DifferenceinAUCrelativetoSATableauSearch(4096restarts)methodson theCOPSbenchmarkdataset.ThetableissortedbyΔAUC,sothatmethods vide such an evaluation: the b-grasp motif from ubiqui- withhigherAUCthanSATableauSearch(ΔAUC<0,withp-value<0.05)are tin [49], and the serpin (serine protease inhibitor) B/C atthetopofthetable,andthosewithlowerAUCthanSAtableausearch (ΔAUC>0,withp-value<0.05)areatthebottomofthetable.Methodsfor sheet substructure [50]. We perform these queries in whichthereisnostatisticallysignificantdifferenceinAUCfromSATableau the ASTRAL SCOP 1.75 95% sequence identity non- Search(4096restarts)atp-value0.05areshowninasinglerowwithΔAUC= redundant database. 0.000. In evaluating the accuracy of a substructure query for comparisons between a single query and multiple data- theb-graspmotif,weusethedatafromTable1ofShiet base structures are run in parallel. Any database scan- al. [4] as the gold standard. A hit is considered a true ning method can be trivially parallelized in the second positive if it is in the same SCOP superfamily as the way, since each pairwise comparison is independent; to exemplars listed in Table 1 of Shi et al. [4] for the b- implement this at all on a GPU requires that the grasp core and gregarious fold [51] categories, or if it is method is capable of being implemented within the lim- oneofthe structuresconsideredby Shiet al.[4]to con- ited computational and fast memory resources of the tain the b-grasp motif by structural drift [52]. We GPU. Since there is no recursion and no dynamic mem- demonstratetwoqueriesforthistest.Firstweuseubiqui- ory allocation in the NVIDIA CUDA programming tin (SCOP identifier d1ubia_), an exemplar of the b- model [48], this can make implementing sophisticated grasp fold, as the query. Second, we use a subset of the algorithms such as those required for pattern searching SSEs, namely the four largest strands and the a-helix in in YAKUSA [12], for example, extremely difficult, if not d1ubia_, chosen to represent the essential part of the impossible. Even where this is possible, since the GPU is b-graspmotif. Thismotifquery isbasedon thatdefined essentially a data-parallel architecture [48], efficient by Shi et al. [4] as a “meta-matrix” for ProSMoS. These implementations require that essentially the same code querystructuresareillustratedinFigure5. path is run simultaneously in a large number of threads, The serpin B/C sheet substructure (see Figure 5) is just with different data. The more threads diverge in such a large and specific structure that we can be confi- their code path, the less efficient the parallelization will dent it is truly present only in instances of the serpin be, another reason why very simple algorithms are more fold. Therefore, in evaluating the accuracy of a substruc- suited to such parallelization. An efficient parallel imple- ture query for this substructure, we consider a hit a true mentation on a GPU requires in addition a “fine- positive if it is a member of the serpin fold in SCOP, grained” level of parallelization, in order to maximize and a false positive otherwise. We choose the B/C sheet the usage ("occupancy”) of the thread multiprocessors of the canonical active serpin, a -antitrypsin, PDB id 1 [48] and the small amount of fast shared memory that 1QLP[53] as the query structure. this finer level of parallelization has access to. SA Of the protein structure comparison methods bench- Tableau Search is ideally suited to this architecture marked in this paper, only QP Tableau Search, SA since its data (tableaux and distance matrices) are Tableau Search, TOPS and ProSMoS are also substruc- shared between the threads running each restart of the ture (motif) finding methods. However, comparison simulated annealing schedule independently, and are using AUC with ProSMoS is not possible as ProSMoS usually small enough to fit in the fast shared memory. It does not rank all database structures. Instead, it returns may be possible to parallelize SHEBA in a similar man- a set of structures that contain the query motif. The ner. However, assuming a similar two-level paralleliza- web server version [54] provides a ranking within this tion, where the coarser level is to run multiple set only, the downloadable version provides no scores or comparisons independently in parallel, and the finer ranking. Stivalaetal.BMCBioinformatics2010,11:446 Page9of17 http://www.biomedcentral.com/1471-2105/11/446 Figure 5 substructure (motif) queries. d1ubia_ (all SSEs) and b-grasp motif query (darker shaded SSEs only) structures (left), and 3D structureofthecanonicalactiveserpin,a-antitrypsin,PDBid1QLP,showingtheB/Csheetusedasthesubstructurequeryasthedarkershaded 1 SSEs(right).ImagescreatedwithPyMOL[81]. Table 7 shows the results of these substructure substructure query. Hence SA Tableau Search could not queries. We can see that, compared to QP Tableau be expected to perform badly on the d1ubia_ query as Search, SA Tableau Search has similar accuracy, and is it has been in some part optimized for this query. This up to 6 times faster on the CPU implementation. The is not the case, however, for the serpin B/C sheet GPU implementation (on the GTX 285 card) is faster substructure. still, providing a speedup of up to 26 times over the CPU implementation of the same algorithm. Conclusions It should be noted that (as described in the Methods We have demonstrated a simulated annealing heuristic section), d1ubia_ is one of the queries used in tuning for tableau-based protein structure and substructure the simulated annealing parameters. For that purpose it searching, that is as fast or faster, and comparable in was evaluated as a fold recognition task, rather than as a accuracy, with some widely used existing methods when Table 7AUC andelapsed timefor somemotif(substructure) queries 95%confidenceinterval Query Method Platform Restarts Elapsedtime AUC standarderror lower upper d1ubia_ SA GTX285 128 00m03s 0.918 0.011 0.896 0.940 d1ubia_ SA CPU 128 01m17s 0.912 0.011 0.890 0.935 d1ubia_ QP CPU - 05m11s 0.902 0.012 0.879 0.926 d1ubia_ TOPS CPU - 00m10s 0.894 0.012 0.870 0.918 b-grasp SA CPU 128 01m11s 0.939 0.010 0.920 0.958 b-grasp QP CPU - 02m01s 0.938 0.010 0.918 0.957 b-grasp SA GTX285 128 00m03s 0.934 0.010 0.914 0.954 b-grasp TOPS CPU - 00m09s 0.847 0.014 0.819 0.875 serpinB/Csheet SA GTX285 128 00m03s 0.993 0.013 0.968 1.019 serpinB/Csheet SA CPU 128 01m19s 0.991 0.015 0.962 1.021 serpinB/Csheet QP CPU - 08m16s 0.986 0.019 0.949 1.023 serpinB/Csheet TOPS CPU - 00m24s 0.491 0.054 0.385 0.597 AreaundertheROCcurve(AUC)andelapsedtimeforsomemotif(substructure)queriesontheASTRALSCOP95%sequenceidentitynon-redundantdatabase. IntheMethodcolumn,QPreferstotheQPTableauSearchmethod,SAreferstoSATableauSearch.InthePlatformcolumn,eitheraGPUcardisdescribed,or CPU,whichisasinglecoreofanAMDQuadCoreOpteron(2.3GHz,32GBRAM)runningLinux.TheresultsforeachqueryaresortedbyAUCdescending. Stivalaetal.BMCBioinformatics2010,11:446 Page10of17 http://www.biomedcentral.com/1471-2105/11/446 run on a standard CPU. In addition, we have provided a parallel), and the second part as T, resulting in a tableau parallel implementation on modern GPU cards that code OT for this angle. achieves a speedup of up to 34 times over the CPU We will denote the N × N tableau for a structure with implementation, making it one of the fastest available N SSEs by T , with elements t , 1 ≤ i, j ≤ N being ij methods. To the best of our knowledge, this is the first tableau codes such as PE or OS. Because the tableau is application of GPUs to the protein structural search symmetric, only the bottom triangle is stored, and since problem. the main diagonal is redundant (the angle between an There may well be scope for application of this techni- SSE and itself), it is used to store the type of the SSE for que to other optimization problems in bioinformatics that row and column. Consider the two helices in Figure that can be approximated by relatively simple optimiza- 7: they are anti-parallel, and have the tableau code OT. tion heuristics capable of being parallelized by imple- This can be seen by finding the two helix codes (xa) on mentation on a GPU. For example, the multistart the main diagonal. The helices in this structure are the variable neighborhood search (VNS) heuristic for maxi- 2nd and 5th SSEs, so the angles relative to these two mum contact map overlap [30], or some other heuristic SSEs are in row (and column) 2 and 5 of the tableau. such as simulated annealing for the same problem, Hence the entry for the angle between these two helices could benefit from parallelization on the GPU. Another is their common row and column, which is OT. In fact, candidate problem of much current interest is biological they have an interaxial angle of approximately 143°. network alignment, a highly computationally intensive For a structure with N SSEs, the distance matrix is a problem that has previously been solved by quadratic square symmetric matrix D = (d ), 1 ≤ i,j ≤ N where ij programming [55], and has recently been approximated each element is the distance (in Ångströms) between by local search heuristics [56] amongst other methods. the centroids of the Ca atoms in SSEs i and j. As with tableaux, these distance matrices apply to SSEs, not Methods residues. Tableaux and distance matrices As originally defined by Lesk [18], and Kamat and A tableau is a discrete encoding of the orientation Lesk [57], tableaux only contain entries for SSEs that matrix for a protein structure, originally defined by Lesk are “in contact” with each other, that is, the correspond- [18]. Other work has shown that tableaux can accurately ing entry in the distance matrix is below some thresh- differentiate folds [57], and can be used for rapid pro- old. In this paper, as in Konagurthu et al. [3] and Stivala tein structural comparison [3,17] and for protein sub- et al. [19], we use a version of tableaux in which every structural search [19]. entry has a value, regardless of the distance between the The orientation matrix is a square symmetric matrix two SSEs. describing the relative orientation of secondary structure elements (SSEs) in the protein. Each element ω , 1 ≤ i, j The tableau matching problem ij ≤ N of the orientation matrix for a structure with N The problem of finding a common (maximally-similar) SSEs is the relative angle between SSEs i and j, num- substructure between two protein structures was formu- bered from the N- to the C-terminus. This matrix is lated as a quadratic integer problem (QIP) of extracting computed by fitting axes to each SSE, and, for every maximally-similar subtableaux by Konagurthu et al. [3]. pair of SSEs, computing the relative angles between We use the same formulation here, using only the dis- their axes. This interaxial angle is defined as the smallest crete tableau, not the continuous orientation matrix, angle required to reorient one axis vector so that it and, as in our previous work [19], we also incorporate a eclipses the other, around the mutual perpendicular distance matrix difference constraint (Equation 7). between the two vectors; or, equivalently, the angle Define Boolean variables x , 1 ≤ i ≤ N , 1 ≤ j ≤ N ij A B between the two vectors projected onto a plane normal where x = 1 indicates that the ith SSE in structure A is ij to their mutual perpendicular [3]. matched with the jth SSE in structure B. Let T =(tA) A ij The tableau is derived from the orientation matrix by and T =(tB) be tableaux for protein structures A and B ij means of a double-quadrant encoding scheme whereby B with N and N SSEs, respectively. Define a scoring A B the angles are classified into quadrants in two different function as: ways that differ in orientation by π/4. This prevents a small variation in angle resulting in two completely dif- ⎧ 2, if tA ≡tB ⎪ ik jl ferent encodings [18], illustrated in Figure 6. For exam- ⎪ (tA,tB)=⎨ 1, if tA tB (1) ple, consider two SSEs that are anti-parallel; they may ik jl ⎪ ik jl −2, otherwise. have an interaxial angle of, say, 143° The first part of ⎩⎩⎪ the encoding scheme gives the tableau code as O (anti-

Description:

primarily for database scanning, but also to provide a set of correspondences between residues. Since the number of residues is naturally much larger than the number of SSEs, these methods must solve problems of a larger size than SSE-based methods. SHEBA [16] and YAKUSA both use a

Fast and accurate protein substructure searching with simulated annealing and GPUs PDF

17 Pages·2010·1.16 MB·English

Checking for file health...

Save to my drive

Quick download

Download

Download Fast and accurate protein substructure searching with simulated annealing and GPUs PDF Free - Full Version

by Unknow| 2010| 17 pages| 1.16| English

Download Fast and accurate protein substructure searching with simulated annealing and GPUs by in PDF format completely FREE. No registration required, no payment needed. Get instant access to this valuable resource on PDFdrive.to!

Free Download PDF

About Fast and accurate protein substructure searching with simulated annealing and GPUs

Detailed Information

Author:	Unknown
Publication Year:	2010
Pages:	17
Language:	English
File Size:	1.16
Format:	PDF
Price:	FREE

Download Free PDF

Safe & Secure Download - No registration required

Why Choose PDFdrive for Your Free Fast and accurate protein substructure searching with simulated annealing and GPUs Download?

100% Free: No hidden fees or subscriptions required for one book every day.
No Registration: Immediate access is available without creating accounts for one book every day.
Safe and Secure: Clean downloads without malware or viruses
Multiple Formats: PDF, MOBI, Mpub,... optimized for all devices
Educational Resource: Supporting knowledge sharing and learning

Frequently Asked Questions

Is it really free to download Fast and accurate protein substructure searching with simulated annealing and GPUs PDF?

Yes, on https://PDFdrive.to you can download Fast and accurate protein substructure searching with simulated annealing and GPUs by completely free. We don't require any payment, subscription, or registration to access this PDF file. For 3 books every day.

How can I read Fast and accurate protein substructure searching with simulated annealing and GPUs on my mobile device?

After downloading Fast and accurate protein substructure searching with simulated annealing and GPUs PDF, you can open it with any PDF reader app on your phone or tablet. We recommend using Adobe Acrobat Reader, Apple Books, or Google Play Books for the best reading experience.

Is this the full version of Fast and accurate protein substructure searching with simulated annealing and GPUs?

Yes, this is the complete PDF version of Fast and accurate protein substructure searching with simulated annealing and GPUs by Unknow. You will be able to read the entire content as in the printed version without missing any pages.

Is it legal to download Fast and accurate protein substructure searching with simulated annealing and GPUs PDF for free?

https://PDFdrive.to provides links to free educational resources available online. We do not store any files on our servers. Please be aware of copyright laws in your country before downloading.

The materials shared are intended for research, educational, and personal use in accordance with fair use principles.