|
研究方向:
|
主持项目:
国家自然科学基金杰出青年基金 | 生物序列计算分析 | |
国家自然科学基金原创探索项目延续资助 | 从支持生物序列机到生物序列神经网络 | |
国家自然科学基金重点项目 | 面向单细胞RNA测序数据的深度迁移模型与细胞通信网络研究 | |
国家自然科学基金原创探索项目 | 支持生物序列机(已结题) | |
国家自然科学基金优秀青年基金 | 生物信息处理与分析 (已结题) | |
四川省杰出青年科技人才项目 | 大规模生物序列分类和聚类方法研究(已结题) | |
国家自然科学基金面上项目 | 利用多序列比对指导16s rRNA的OTU聚类 (已结题) | |
国家自然科学基金面上项目 | 基于MapReduce的非编码RNA“从头预测”识别方法研究 (已结题) | |
国家自然科学基金青年基金 | 基于投票机制的非编码RNA“从头预测”识别方法研究 (已结题) | |
福建省自然科学基金面上项目 | 转录组数据中的microRNA和SNP挖掘方法研究 (已结题) |
学术兼职:
出版著作:
特约报告:
代表论文:
- Retrosynthesis prediction with an interpretable deep-learning framework based on molecular assembly tasks. Nature Communications. 2023, 14: 6155 (codes)
- Inferring gene regulatory network from single-cell transcriptomes with graph autoencoder model. PLoS Genetics. 2023, 19(9): e1010942 (codes)
- A computational model of circRNA-associated diseases based on a graph neural network: prediction, and case studies for follow-up experimental validation. BMC Biology. 2024, 22:24. (codes)
- Prediction of protein solubility based on sequence physicochemical patterns and distributed representation information with DeepSoluE. BMC Biology. 2023, 21:12. (web server)
- m5U-SVM: Identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation. BMC Biology. 2023, 21: 93(web server)
- MRMD3.0: A Python tool and webserver for dimensionality reduction and data visualization via an ensemble strategy. Journal of Molecular Biology. 2023, 435: 168116 (web site)
- HAlign 3: fast multiple alignment of ultra-large numbers of similar DNA/RNA sequences. Molecular Biology and Evolution. 2022,39(8):msac166. (codes)
- TPMA: a two pointers meta-alignment tool to ensemble different multiple nucleic acid sequence alignments. PLoS Computational Biology. 2024, 20(4): e1011988 (codes)
- AutoEdge-CCP: A Novel Approach for Predicting Cancer-Associated CircRNAs and Drugs based on Automated Edge Embedding. PLoS Computational Biology. 2024, 20(1): e1011851 (codes)
- A multi-label learning model for predicting drug-induced pathology in multi-organ based on toxicogenomics data. PLoS Computational Biology. 2022, 18(9): e1010402 (codes)
- A comprehensive overview and evaluation of circular RNA detection tools. PLoS Computational Biology. 2017,13(6): e1005420 (SCI, IF2017=3.955, PMID: 28594838) (data and codes)(BibTeX, EndNote)
- Gene2vec: Gene Subsequence Embedding for Prediction of Mammalian N6‐Methyladenosine Sites from mRNA. RNA. 2019, 25(2): 205-218 (SCI, IF2017=4.490, PMID: 30425123) (web server)(BibTeX, EndNote)
- RAVAR: a curated repository for rare variant-trait associations. Nucleic Acids Research. 2024, 52(D1): D990-D997. (web server)
- webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study. Nucleic Acids Research. 2022, 50(D1): D1123-D1130. (web site)
- DeepBIO: An automated and interpretable deep-learning platform for high-throughput biological sequence prediction, functional annotation, and visualization analysis. Nucleic Acids Research. 2023, 51(7): 3017-3029. (web server)
- PaGeFinder: Quantitative Identification of Spatiotemporal Pattern Genes. Bioinformatics. 2012, 28(11):1544-1545. (SCI, IF2017=5.481, PMID:22492640)(Web Server)(BibTeX, EndNote)
- HAlign: Fast Multiple Similar DNA/RNA Sequence Alignment Based on the Centre Star Strategy. Bioinformatics. 2015,31(15): 2475-2481. (SCI, IF2017=5.481, PMID: 25812743) (Software)(该软件被OMICTOOLS推荐)(BibTeX, EndNote)
- Tumor Origin Detection with Tissue-Specific miRNA and DNA methylation Markers. Bioinformatics. 2018, 34(3): 398-406. (SCI, IF2017=5.481, PMID:29028927) (web server) (BibTeX, EndNote)
- O-GlcNAcPRED-II: an integrated classification algorithm for identifying O-GlcNAcylation sites based on fuzzy undersampling and a K-means PCA oversampling technique. Bioinformatics. 2018, 34(12): 2029-2036. (SCI, IF2017=5.481, PMID:29420699)(web server)(该软件被OMICTOOLS推荐)(BibTeX, EndNote)
- Prediction of potential disease-associated microRNAs using structural perturbation method. Bioinformatics. 2018, 34(14): 2425-2432.(SCI, IF2017=5.481, PMID:29490018)(codes)(BibTeX, EndNote)
- 4mCPred: Machine Learning Methods for DNA N4-methylcytosine sites Prediction. Bioinformatics. 2019, 35(4): 593-601 (SCI, IF2017=5.481, PMID: 30052767)(web server)(该软件被OMICTOOLS推荐)
- Exploring sequence-based features for the improved prediction of DNA N4-methylcytosine sites in multiple species. Bioinformatics. 2019, 35(8): 1326-1333. (SCI, IF2017=5.481, PMID: 30239627)(web server)
- PEPred-Suite: improved and robust prediction of therapeutic peptides using adaptive feature representation learning. Bioinformatics. 2019, 35(21):4272-4280. (SCI, IF2017=5.481, PMID:30994882) (web server)
- Iterative feature representations improve N4-methylcytosine site prediction. Bioinformatics. 2019, 35(23): 4930-4937. (SCI, IF2017=5.481, PMID:31099381)(web server)
- StackCPPred: A Stacking and Pairwise Energy Content based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency. Bioinformatics. 2020, 36(10):3028-3034. (code)
- PPTPP: A novel therapeutic peptide prediction method using physicochemical property encoding and adaptive feature representation learning. Bioinformatics. 2020, 36(13): 3982-3987 (codes and data)
- Basic polar and hydrophobic properties are the main characteristics that affect the binding of transcription factors to methylation sites. Bioinformatics. 2020,36(15):4263-4268 (Supplementary data)
- Identification of Sub-Golgi Protein Localization by Use of Deep Representation Learning Features. Bioinformatics. 2020, 36(24):5600-5609 (web server)
- BP4RNAseq: a babysitter package for retrospective and newly generated RNA-seq data analyses using both alignment-based and alignment-free quantification methods. Bioinformatics. 2021, 37(9):1319-1321. (codes) High impact research from Bioinformatics
- DeepAc4C: A convolutional neural network model with hybrid features composed of physico-chemical patterns and distributed representation information for identification of N4 acetylcytidine in mRNA. Bioinformatics. 2022, 38(1): 52-57. (web server)
- NerLTR-DTA: Drug-target binding affinity prediction based on neighbor relationship and learning to rank. Bioinformatics. 2022, 38(7): 1964-1971. (codes and datasets)
- GMNN2CD: Identification of circRNA–disease associations based on variational inference and graph Markov neural networks. Bioinformatics. 2022,38(8):2246-2253. (codes and data)
- webSCST: an interactive web application for single-cell RNA-sequencing data and spatial transcriptomic data integration. Bioinformatics. 2022, 38(13):3488-3489. (web server)
- Effector-GAN: prediction of fungal effector proteins based on pretrained deep representation learning methods and generative adversarial networks. Bioinformatics. 2022, 38(14):3541-3548. (web server)
- i6mA-Caps: A CapsuleNet-based framework for identifying DNA N6-methyladenine sites. Bioinformatics. 2022, 38(16): 3885-3891. (web server)
- CircSI-SSL: circRNA-binding site identification based on self-supervised learning. Bioinformatics. 2024, 40(1): btae004. (codes)
- FMAlign2: A novel fast multiple nucleotide sequence alignment method for ultralong datasets. Bioinformatics. 2024, 40(1): btae014(codes)
- Revisiting Drug-Protein Interaction Prediction: A Novel Global-Local Perspective. Bioinformatics. 2024, 40(5): btae271. (codes)
- scTPC: a novel semi-supervised deep clustering model for scRNA-seq data. Bioinformatics. 2024, 40(5): btae293 (codes)
- DeepAVP-TPPred: identification of antiviral peptides using transformed image-based localized descriptors and binary tree growth algorithm. Bioinformatics. 2024, 40(5): btae305 (codes)
- Drug-target interaction predictions with multi-view similarity network fusion strategy and deep interactive attention mechanism. Bioinformatics. 2024, 40(6): btae346. (codes)
- GraphADT: Empowering Interpretable Predictions of Acute Dermal Toxicity with Multi-View Graph Pooling and Structure Remapping. Bioinformatics. 2024, 40(7): btae438. (codes)
- Sequence clustering in bioinformatics: an empirical study. Briefings in Bioinformatics. 2020,21(1): 1-10 (SCI, IF2017=6.302, PMID: 30239587)(data) Highly Cited Articles from Briefings in Bioinformatics
- Prediction of bio-sequence modifications and the associations with diseases. Briefings in Functional Genomics. 2021, 20(1): 1-18 (data)Editor's Choice Highly Cited Articles from Briefings in Functional Genomics
- CPPred-RF: a sequence-based predictor for identifying cell-penetrating peptides and their uptake efficiency. Journal of Proteome Research.2017, 16(5):2044-2053. (SCI, IF2018=3.780)(web server)(入选ACS出版社庆祝中科院建院70周年Highlight文章)
- Prediction of human protein subcellular localization using deep learning. Journal of Parallel and Distributed Computing. 2018, 117: 212-217 (SCI, IF2017=1.815) most cited articles in this journal (BibTeX, EndNote)
- Review and comparative analysis of machine learning-based phage virion protein identification methods. BBA - Proteins and Proteomics. 2020,1868(6):140406 Featured in the BBA Collection on "Viruses" and "Artificial Intelligence for biochemistry and molecular biology: AI is learning to help"
- Multiple Sequence Alignment Based on a Suffix Tree and Center-Star Strategy: A Linear Method for Multiple Nucleotide Sequence Alignment on Spark Parallel Framework. Journal of Computational Biology. 2017, 24(12): 1230-1242 (SCI, IF2017=1.191, PMID: 29116822) (codes)(该软件被OMICTOOLS推荐)(BibTeX, EndNote)
所获奖励:
培养学生:
联系方式:
最后修改时间:2024.8.23
蜀ICP备19040033号