Conference
-
PMKLC: Parallel Multi-Knowledge Learning-based Lossless Compression for Large-Scale Genomics Database
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025Hui Sun, Yanfeng Ding, Liping Yi, Huidong Ma, Gang Wang, Xiaoguang Liu, Cheng Zhong, Wentong Cai
-
MSDZip: Universal Lossless Compression for Multi-source Data via Stepwise-parallel and Learning-based Prediction
The Web Conference (WWW), 2025Huidong Ma†, Hui Sun†, Liping Yi, Yanfeng Ding, Xiaoguang Liu, Gang Wang
-
Multi-source Data Lossless Compression via Parallel Expansion Mapping and xLSTM
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025Huidong Ma, Hui Sun, Liping Yi, Xiaoguang Liu, Gang Wang
-
Adaptive Lossless Compression for Genomics Data by Multiple (s, k)-mer Encoding and XLSTM
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025Hui Sun†, Yanfeng Ding†, Liping Yi, Huidong Ma, Haonan Xie, Gang Wang, Xiaoguang Liu
-
Genomics Data Lossless Compression with (s,k)-mer Encoding and Deep Neural Networks
Annual AAAI Conference on Artificial Intelligence (AAAI), 2025Hui Sun†, Liping Yi†, Huidong Ma, Yongxia Sun, Yingfeng Zheng, Wenwen Cui, Meng Yan, Xiaoguang Liu, Gang Wang
-
LRCB: A Comprehensive Benchmark Evaluation of Reference-free Lossless Compression Tools for Genomics Sequencing Long Reads Data
Data Compression Conference (DCC), 2024Hui Sun†, Huidong Ma†, Yingfeng Zheng, Haonan Xie, Meng Yan, Cheng Zhong, Xiaoguang Liu, Gang Wang
-
SR2C: A Structurally Redundant Short Reads Collapser for Optimizing DNA Data Compression
International Conference on Parallel and Distributed Systems (ICPADS), 2023Hui Sun†, Huidong Ma†, Yingfeng Zheng, Haonan Xie, Xiaofei Wang, Xiaoguang Liu, Gang Wang
-
ricME: Long-Read Based Mobile Element Variant Detection Using Sequence Realignment and Identity Calculation
International Symposium on Bioinformatics Research and Applications (ISBRA), 2023Huidong Ma, Cheng Zhong, Hui Sun, Danyang Chen, Haixiang Lin
Journal
-
A Survey and Benchmark Evaluation for Neural-Network-BasedLossless Universal Compressors Toward Multi-Source Data
Frontiers of Computer Science (FCS), 2025Hui Sun†, Huidong Ma†, Feng Ling, Haonan Xie, Yongxia Sun, Liping Yi, Meng Yan, Cheng Zhong, Xiaoguang Liu, Gang Wang
-
PQSDC: a parallel lossless compressor for quality scores data via sequences partition and Run-Length prediction mapping
Bioinformatics, 2024Hui Sun, Yingfeng Zheng, Haonan Xie, Huidong Ma, Cheng Zhong, Meng Yan, Xiaoguang Liu, Gang Wang
-
PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering
BMC Bioinformatics, 2023Hui Sun, Yingfeng Zheng, Haonan Xie, Huidong Ma, Xiaoguang Liu, Gang Wang
-
cnnLSV: detecting structural variants by encoding long-read alignment information and convolutional neural network
BMC Bioinformatics, 2023Huidong Ma, Cheng Zhong, Danyang Chen, Haofa He, Feng Yang
Under Review
-
Lossless Genomic Data Storage and Sharing with Sensitive Masking and Learning-based Compression
Conference on Neural Information Processing Systems (NeurIPS), 2025Sun Hui†, Huidong Ma†, Meng Yan, Yingfeng Zheng, Cheng Zhong, Gang Wang, Xiaoguang Liu, Wentong Cai
-
A Comprehensive Review and Benchmark of Lossless Compression for Large Language Models
Conference on Neural Information Processing Systems (NeurIPS), 2025Sun Hui†, Jiashun Chen†, Huidong Ma, Haonan Xie, Cheng Zhong, Gang Wang, Xiaoguang Liu, Wentong Cai