Program of RECOMB-seq 2024

 

Saturday, April 27, 2024 (Tentative Schedule)

Session Time Title Author(s)
Breakfast 9:00 AM    
Keynote (Chaired by Mingfu Shao) 9:30 AM Challenges in de novo sequence assembly Heng Li
Coffee Break 10:30 AM    
Proceedings (Chaired by Can Alkan)
10:45 AM Movi: a fast and cache-efficient full-text pangenome index Mohsen Zakeri, Nathaniel Brown, Omar Ahmed, Travis Gagie and Ben Langmead
11:10 AM Building a Pangenome Alignment Index via Recursive Prefix-Free Parsing Eddie Ferro, Marco Oliva, Travis Gagie and Christina Boucher
11:35 AM The Backpack Quotient Filter: a dynamic and space-efficient data structure for querying k-mers with abundance Victor Levallois, Pierre Peterlongo, Yoann Dufresne, Bertrand Le Gal and Francesco Andreace
Lunch break 12:00 PM    
Short-Talks I (Chaired by Giulio E. Pibiri)
1:30 PM Compressed Indexing for Pangenome Substring Queries  Stephen Hwang, Nathaniel K. Brown, Omar Y. Ahmed, Katharine Jenike, Sam Kovaka, Michael C. Schatz and Ben Langmead
1:40 PM Pan-genome de Bruijn Graph using the Bidirectional FM-index Lore Depuydt, Luca Renders, Thomas Abeel and Jan Fostier
1:50 PM Mumemto: efficient maximal matching across multiple genomes Vikram Shivakumar and Ben Langmead
2:00 PM A*PA & A*PA2: Up to 20 times faster exact global pairwise alignment Ragnar Groot Koerkamp and Pesho Ivanov
Break 2:10 PM    
Short-Talks II (Chaired by Stefan Canzar)
2:30 PM Accelerating whole-genome alignment using parallel chaining algorithm Ghanshyam Chandra and Chirag Jain
2:40 PM Full resolution HLA and KIR genes annotation for human genome assemblies Ying Zhou, Li Song and Heng Li
2:50 PM Combining DNA and protein alignments to improve genome annotation with LiftOn Kuan-Hao Chao, Jakob M Heinz, Celine Hoh, Alan Mao, Alaina Shumate, Mihaela Pertea and Steven L Salzberg
3:00 PM DupCaller enables robust detection of somatic mutations from Error-Corrected Sequencing Yuhe Cheng and Ludmil B Alexandrov
Posters 3:30 PM (see list below)  
Overlay (Chaired by Chirag Jain)
4:30 PM PLA-complexity of k-mer multisets Md Hasin Abrar and Paul Medvedev
4:55 PM Phasing Diploid Genome Assembly Graphs with Single-Cell Strand Sequencing Mir Henglin, Maryam Ghareghani, William Harvey, David Porubsky, Sergey Koren, Evan Eichler, Peter Ebert and Tobias Marschall
5:20 PM CNVeil enables accurate and robust tumor subclone identification and copy number estimation from single cell sequencing data Weiman Yuan, Can Luo, Yunfei Hu, Liting Zhang, Zihang Wen, Yichen Henry Liu, Xian Mallory and Xin Maizie Zhou
Closing 5:45 PM    
Posters to be presented on Saturday April 27, 2024
Posters corresponding to Saturday short-talks are invited to present on Saturday.
  • Alexis Garretson and Beth L. Dumont. Genetic Architecture of the Germline Mutation Rate and Reproductive Success in the Collaborative Cross
  • Estefania Torrejón, Akiko Teshima, Inês Ferreira, Ana Sofia Carvalho, Hans Christian Beck, Rune Matthiesen, Fabrizia Carli, Amalia Gastaldelli, Maria Paula Macedo and Rita Machado de Oliveira. Gut derived Extracellular Vesicles Reaching Kupffer Cells: An alternative route for lipid transport out of the gut
  • Michelle Wei and Guang Yao. QDSWorkflow: An Elastic Net-Based Tool for Modeling Cellular Dormancy
  • Thi Duyen Do, Ming-Ren Yang and Yu-Wei Wu. A Unitig-Centered Pan-Genome Approach for Predicting Antibiotic Resistance and Discovering Novel Resistance Genes in Bacterial Strains
  • Grigore Boldirev, Nitesh Sharma, Alex Zelikovsky and Serghei Mangul. Assessing Microbial Genome Representation Across Various Reference Databases: A Comprehensive Evaluation
  • Siqin Li, Nicole Genesis Nicole Carpio Paucar and Natalie Farny. Validating a liquid soil model to explore soil microbial community dynamics
  • Ndivhuwo Nyase, Lebohang Mashatola, Stephanie Muller, Aviwe Kohlakala and Kahn Rhrissorrakrai. WGTDA: A Topological Framework for Biomarker Discovery in Gene Expression Data
  • Ragnar Groot Koerkamp and Daniel Liu. PA-Bench: A framework for benchmarking pairwise aligners
  • Daniel Liu and Martin Steinegger. Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices

 

Sunday, April 28, 2024 (Tentative Schedule)

Session Sunday Title Author(s)
Breakfast 9:00 AM    
Keynote (Chaired by Ritambhara Singh) 9:30 AM A new view on sequence alignments: from interactions to language model Mona Singh
Coffee Break 10:30 AM    
Proceedings (Chaired by Lucy Williams)
10:45 AM Synthetic promoter design in Escherichia coli based on multinomial diffusion model Qixiu Du, May Nee Poon, Xiaocheng Zeng, Pengcheng Zhang, Zheng Wei, Haochen Wang, Ye Wang, Lei Wei and Xiaowo Wang
11:10 AM Quick and effective approximation of in silico saturation mutagenesis experiments with first-order Taylor expansion Alexander Sasse, Sara Mostafavi and Maria Chikina
11:35 AM Flowtigs: safety in flow decompositions for assembly graphs Francisco Sena, Eliel Ingervo, Shahbaz Khan, Andrey Prjibelski, Sebastian Schmidt and Alexandru Tomescu
Lunch break 12:00 PM    
Short-Talks I (Chaired by Baraa Orabi)
1:30 PM Comprehensive Tissue-Specific Somatic Mutation Profiling via RNA-seq in Diverse Mice Alexis Garretson and Beth L Dumont
1:40 PM VISTA: An integrated framework for structural variant discovery Varuni Sarwal, Seungmo Lee, Jianzhi Yang, Sriram Sankararaman, Mark Chaisson, Eleazar Eskin and Serghei Mangul
1:50 PM An efficient and accurate germline SNP caller for long-read RNA sequencing data Neng Huang and Heng Li
2:00 PM Analyzing the relatedness of genomic variation in malaria parasites using a reference-free approach Cecile P G Meier-Scherling, Tavor Baharav, Karamoko Niaré, Julia Salzman, Lorin Crawford and Jeffrey A Bailey
Break 2:10 PM    
Short-Talks II (Chaired by Li Song)
2:30 PM Identification of B cell subsets based on antigen receptor sequences using deep learning Hyunho Lee, Kyoungseob Shin, Yongju Lee, Soobin Lee, Seungyoun Lee, Eunjae Lee, Seung Woo Kim, Ha Young Shin, Jong Hoon Kim, Junho Chung and Sunghoon Kwon
(canceled) Comprehensive benchmarking of methods to infer from TCR-Seq data Mohammad Vahed, Yu Ning Huang, Jiaqi Fu, Kerui Peng and Serghei Mangul
2:40 PM Optimizing Design of Genomics Studies for Clonal Evolution Analysis Arjun Srivatsa and Russell Schwartz
Posters 3:30 PM (see list below)  
Overlay (Chaired by Askar Gafurov)
4:30 PM Quality assessment of splice site annotation based on conservation across multiple species Ilia Minkin and Steven Salzberg
4:55 PM MaskGraphene: Advancing joint embedding, clustering, and batch correction for spatial transcriptomics using graph-based self-supervised learning Yunfei Hu, Yikang Li, Manfei Xie, Mingxing Rao, Yichen Henry Liu and Xin Maizie Zhou
5:20 PM Investigating the performance of foundation models on human 3’UTR sequences Sergey Vilov and Matthias Heinig
Closing 5:45 PM    
Posters to be presented on Sunday April 28, 2024
Posters corresponding to Sunday short-talks are invited to present on Sunday.
  • Yunzhe Jiang, Beatrice Borsari and Mark Gerstein. Comprehensive characterization of pseudogenes across 26 human tissues
  • Maria Chikina and Tugrul Balci. A unified hypothesis-free feature extraction framework for diverse epigenomic data
  • Karishma Chhugani, Serghei Mangul, Oleg Arnaut, Nitesh Sharma and Belin Korukoğlu. Benchmarking of machine learning algorithms to predict mortality in sepsis from transcriptomic data
  • Dhrithi Deshpande and Serghei Mangul. Comparative Evaluation of T-Cell Receptor Repertoire Sequencing Methods
  • Muhammet Celik, Felix Roman Salcher, Frank Ziemann, Maria Solovey and Maria Colome-Tatche. Community: A Novel R-Tool for Enhanced Differential Communication Analysis in scRNAseq Data
  • Pratik Worah. Recovering approximate single cell distribution from aggregate measurements
  • Dallas Nygard, Julie St-Pierre and Mathieu Lavallée-Adam. Statistical assessment of gene functional annotation clustering in graph models of chromosome conformation capture data
  • Timothy Barry, Joseph Deutch, Xihong Lin and Eugene Katsevich. sceptre: statistically rigorous, computationally efficient, and user-friendly single-cell CRISPR screen data analysis
  • Amanda Clare, Wayne Aubrey, Mike Surette and Nicholas Dimonaco. Partial gene predictions on unassembled reads: evaluating the Good, the Bad and the slightly ORF
  • Parishad Mokhber, Alireza Gargoorimotlagh and Babak Khalaj. scFedVI: A Privacy-Preserving Approach to Mitigating Batch Effects in Single-Cell RNA-Sequencing Data

 

 

 

 

Accepted Papers to the Proceedings Track

  • Qixiu Du, May Nee Poon, Xiaocheng Zeng, Pengcheng Zhang, Zheng Wei, Haochen Wang, Ye Wang, Lei Wei and Xiaowo Wang. Synthetic promoter design in Escherichia coli based on multinomial diffusion model.
  • Mohsen Zakeri, Nathaniel Brown, Omar Ahmed, Travis Gagie and Ben Langmead. Movi: a fast and cache-efficient full-text pangenome index.
  • Eddie Ferro, Marco Oliva, Travis Gagie and Christina Boucher. Building a Pangenome Alignment Index via Recursive Prefix-Free Parsing.
  • Francisco Sena, Eliel Ingervo, Shahbaz Khan, Andrey Prjibelski, Sebastian Schmidt and Alexandru Tomescu. Flowtigs: safety in flow decompositions for assembly graphs.
  • Victor Levallois, Pierre Peterlongo, Yoann Dufresne, Bertrand Le Gal and Francesco Andreace. The Backpack Quotient Filter: a dynamic and space-efficient data structure for querying k-mers with abundance.
  • Alexander Sasse, Sara Mostafavi and Maria Chikina. Quick and effective approximation of in silico saturation mutagenesis experiments with first-order Taylor expansion.

Accepted Papers to the Overlay Track

  • Ilia Minkin and Steven Salzberg. Quality assessment of splice site annotation based on conservation across multiple species.
  • Yunfei Hu, Yikang Li, Manfei Xie, Mingxing Rao, Yichen Henry Liu and Xin Maizie Zhou. MaskGraphene: Advancing joint embedding, clustering, and batch correction for spatial transcriptomics using graph-based self-supervised learning.
  • Tim Dunn, Justin Zook, James Holt and Satish Narayanasamy. Jointly benchmarking small and structural variant calls with vcfdist.
  • Md Hasin Abrar and Paul Medvedev. PLA-complexity of k-mer multisets.
  • Mir Henglin, Maryam Ghareghani, William Harvey, David Porubsky, Sergey Koren, Evan Eichler, Peter Ebert and Tobias Marschall. Phasing Diploid Genome Assembly Graphs with Single-Cell Strand Sequencing.
  • Sergey Vilov and Matthias Heinig. Investigating the performance of foundation models on human 3’UTR sequences.
  • Weiman Yuan, Can Luo, Yunfei Hu, Liting Zhang, Zihang Wen, Yichen Henry Liu, Xian Mallory and Xin Maizie Zhou. CNVeil enables accurate and robust tumor subclone identification and copy number estimation from single cell sequencing data.

Accepted Short Talks

  • Mohammad Vahed, Yu Ning Huang, Jiaqi Fu, Kerui Peng and Serghei Mangul. Comprehensive benchmarking of methods to infer from TCR-Seq data.
  • Alexis Garretson and Beth L. Dumont. Comprehensive Tissue-Specific Somatic Mutation Profiling via RNA-seq in Diverse Mice.
  • Yuhe Cheng and Ludmil B. Alexandrov. DupCaller enables robust detection of somatic mutations from Error-Corrected Sequencing.
  • Li Su, Duolin Wang, Fei He and Dong Xu. Integrating scRNA-seq and scTCR-seq data to identify antigen-reactive T cells in T cell repertoire.
  • Ying Zhou, Li Song and Heng Li. Full resolution HLA and KIR genes annota)on for human genome assemblies.
  • Neng Huang and Heng Li. An efficient and accurate germline SNP caller for long-read RNA sequencing data.
  • Hyunho Lee, Kyoungseob Shin, Yongju Lee, Soobin Lee, Seungyoun Lee, Eunjae Lee, Seung Woo Kim, Ha Young Shin, Jong Hoon Kim, Junho Chung and Sunghoon Kwon. Identification of B cell subsets based on antigen receptor sequences using deep learning.
  • Arjun Srivatsa and Russell Schwartz. Optimizing Design of Genomics Studies for Clonal Evolution Analysis.
  • Kuan-Hao Chao, Jakob M. Heinz, Celine Hoh, Alan Mao, Alaina Shumate, Mihaela Pertea and Steven L Salzberg. Combining DNA and protein alignments to improve genome annotation with LiftOn.
  • Lore Depuydt, Luca Renders, Thomas Abeel and Jan Fostier. Pan-genome de Bruijn Graph using the Bidirectional FM-index.
  • Ghanshyam Chandra and Chirag Jain. Accelerating whole-genome alignment using parallel chaining algorithm.
  • Vikram Shivakumar and Ben Langmead. Mumemto: efficient maximal matching across multiple genomes.
  • Stephen Hwang, Nathaniel K. Brown, Omar Y. Ahmed, Katharine Jenike, Sam Kovaka, Michael C. Schatz and Ben Langmead. Compressed Indexing for Pangenome Substring Queries.
  • Varuni Sarwal, Seungmo Lee, Jianzhi Yang, Sriram Sankararaman, Mark Chaisson, Eleazar Eskin and Serghei Mangul. VISTA: An integrated framework for structural variant discovery.
  • Cecile P G Meier-Scherling, Tavor Baharav, Karamoko Niaré, Julia Salzman, Lorin Crawford and Jeffrey A Bailey. Analyzing the relatedness of genomic variation in malaria parasites using a reference-free approach.
  • Ragnar Groot Koerkamp and Pesho Ivanov. A*PA & A*PA2: Up to 20 times faster exact global pairwise alignment.

Accepted Posters

(NOTE: authors of accepted short-talks are also invited to present a poster.)
  • Alexis Garretson and Beth L. Dumont. Genetic Architecture of the Germline Mutation Rate and Reproductive Success in the Collaborative Cross
  • Estefania Torrejón, Akiko Teshima, Inês Ferreira, Ana Sofia Carvalho, Hans Christian Beck, Rune Matthiesen, Fabrizia Carli, Amalia Gastaldelli, Maria Paula Macedo and Rita Machado de Oliveira. Gut derived Extracellular Vesicles Reaching Kupffer Cells: An alternative route for lipid transport out of the gut
  • Yunzhe Jiang, Beatrice Borsari and Mark Gerstein. Comprehensive characterization of pseudogenes across 26 human tissues
  • Maria Chikina and Tugrul Balci. A unified hypothesis-free feature extraction framework for diverse epigenomic data
  • Michelle Wei and Guang Yao. QDSWorkflow: An Elastic Net-Based Tool for Modeling Cellular Dormancy
  • Thi Duyen Do, Ming-Ren Yang and Yu-Wei Wu. A Unitig-Centered Pan-Genome Approach for Predicting Antibiotic Resistance and Discovering Novel Resistance Genes in Bacterial Strains
  • Karishma Chhugani, Serghei Mangul, Oleg Arnaut, Nitesh Sharma and Belin Korukoğlu. Benchmarking of machine learning algorithms to predict mortality in sepsis from transcriptomic data
  • Grigore Boldirev, Nitesh Sharma, Alex Zelikovsky and Serghei Mangul. Assessing Microbial Genome Representation Across Various Reference Databases: A Comprehensive Evaluation
  • Siqin Li, Nicole Genesis Nicole Carpio Paucar and Natalie Farny. Validating a liquid soil model to explore soil microbial community dynamics
  • Dhrithi Deshpande and Serghei Mangul. Comparative Evaluation of T-Cell Receptor Repertoire Sequencing Methods
  • Muhammet Celik, Felix Roman Salcher, Frank Ziemann, Maria Solovey and Maria Colome-Tatche. Community: A Novel R-Tool for Enhanced Differential Communication Analysis in scRNAseq Data
  • Pratik Worah. Recovering approximate single cell distribution from aggregate measurements
  • Dallas Nygard, Julie St-Pierre and Mathieu Lavallée-Adam. Statistical assessment of gene functional annotation clustering in graph models of chromosome conformation capture data
  • Ndivhuwo Nyase, Lebohang Mashatola, Stephanie Muller, Aviwe Kohlakala and Kahn Rhrissorrakrai. WGTDA: A Topological Framework for Biomarker Discovery in Gene Expression Data
  • Timothy Barry, Joseph Deutch, Xihong Lin and Eugene Katsevich. sceptre: statistically rigorous, computationally efficient, and user-friendly single-cell CRISPR screen data analysis
  • Amanda Clare, Wayne Aubrey, Mike Surette and Nicholas Dimonaco. Partial gene predictions on unassembled reads: evaluating the Good, the Bad and the slightly ORF
  • Ragnar Groot Koerkamp and Daniel Liu. PA-Bench: A framework for benchmarking pairwise aligners
  • Daniel Liu and Martin Steinegger. Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices
  • Parishad Mokhber, Alireza Gargoorimotlagh and Babak Khalaj. scFedVI: A Privacy-Preserving Approach to Mitigating Batch Effects in Single-Cell RNA-Sequencing Data