A Systematic Review - Application of Variants of Rough Sets for Gene Expression Analysis

Authors

  • Atti Manga Devi Assistant Professor, Department of Information Technology, Pragati Engineering College (A), Surampalem, Andhra Pradesh, India
  • Yamuna Mundru Assistant Professor, Department of CSE-AI & ML, Pragati Engineering College (A) Surampalem, Andhra Pradesh, India
  • Manas Kumar Yogi Assistant Professor, Department of Computer Science & Engineering, Pragati Engineering College (A) Surampalem, Andhra Pradesh, India

Keywords:

Dimensionality reduction, Fuzzy-rough sets, Gene expression analysis, Genomic data, RNA sequencing

Abstract

Gene expression analysis plays a pivotal role in understanding disease mechanisms and identifying therapeutic targets, yet its high-dimensional, noisy nature poses significant analytical challenges. Rough Set Theory (RST) and its variants including fuzzy-rough sets, neighborhood rough sets, and hybrid models offer powerful solutions by enabling robust feature selection, classification, and uncertainty management in genomic data. This review explores the application of RST in gene expression analysis, highlighting its advantages over traditional methods like PCA and SVMs, such as superior interpretability, noise tolerance, and the ability to handle missing data. We discuss key methodologies, from reduct-based biomarker discovery to rule-based classification, and present case studies demonstrating their success in cancer subtype prediction and multi-omics integration. Comparative analyses reveal that rough set variants outperform classical approaches in scenarios requiring granular data handling, such as single-cell RNA sequencing and dynamic datasets. Emerging trends, including deep rough neural networks and Explainable AI (XAI) frameworks, are also examined, showcasing their potential to bridge the gap between interpretability and predictive power in precision medicine. Despite challenges in scalability and integration with deep learning, rough sets remain indispensable for extracting biologically meaningful insights from complex genomic data. This review underscores their transformative potential in bioinformatics, advocating for continued innovation to address the evolving demands of high-throughput genomics and personalized healthcare. 

References

W. Ji, Y. Pang, X. Jia, Z. Wang, F. Hou, B. Song, M. Liu, and R. Wang, "Fuzzy rough sets and fuzzy rough neural networks for feature selection: A review," Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol. 11, no. 3, p. e1402, May 2021, doi: https://doi.org/10.1002/widm.1402

W. Xu and Q. Bu, "Matrix-based incremental feature selection method using weight-partitioned multigranulation rough set," Information Sciences, vol. 681, pp. 121219, Oct. 1, 2024, doi: https://doi.org/10.1016/j.ins.2024.121219

M. Hu, E. C. Tsang, Y. Guo, D. Chen, and W. Xu, "Attribute reduction based on overlap degree and k-nearest-neighbor rough sets in decision information systems," Information Sciences, vol. 584, pp. 301–324, Jan. 2022, doi: https://doi.org/10.1016/j.ins.2021.10.063

B. M. R. Bagus and S. Hanaoka, “The central tendency of the seaport-fulcrum supply chain risk in Indonesia using a rough set,” The Asian Journal of Shipping and Logistics, vol. 38, no. 4, pp. 222–233, Dec. 2022, doi: https://doi.org/10.1016/j.ajsl.2022.08.003

M. Ilyas, K. M. Aamir, S. Manzoor, and M. Deriche, "Linear programming based computational technique for leukemia classification using gene expression profile," PLoS One, vol. 18, no. 10, p. e0292172, Oct. 2023, doi: https://doi.org/10.1371/journal.pone.0292172

Y. Yu, M. Wan, J. Qian, D. Miao, Z. Zhang, and P. Zhao, "Feature selection for multi-label learning based on variable-degree multi-granulation decision-theoretic rough sets," International Journal of Approximate Reasoning, vol. 169, p. 109181, Jun. 2024, doi: https://doi.org/10.1016/j.ijar.2024.109181

R. Molania, M. Foroutan, J. A. Gagnon-Bartsch, L. C. Gandolfo, A. Jain, A. Sinha, G. Olshansky, A. Dobrovic, A. T. Papenfuss, and T. P. Speed, “Removing unwanted variation from large-scale RNA sequencing data with PRPS,” Nature Biotechnology, vol. 41, no. 1, pp. 82–95, Jan. 2023, doi: https://doi.org/10.1038/s41587-022-01440-w

D. Li, H. Zhang, T. Li, A. Bouras, X. Yu, and T. Wang, "Hybrid missing value imputation algorithms using fuzzy c-means and vaguely quantified rough set," IEEE Transactions on Fuzzy Systems, vol. 30, no. 5, pp. 1396–1408, May 2022, doi: https://doi.org/10.1109/TFUZZ.2021.3058643

M. A. Sewwandi, Y. Li, and J. Zhang, "A class-specific feature selection and classification approach using neighborhood rough set and K-nearest neighbor theories," Applied Soft Computing, vol. 143, p. 110366, Aug. 2023, doi: https://doi.org/10.1016/j.asoc.2023.110366

J. Xu, M. Yuan, and Y. Ma, "Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set," Complex & Intelligent Systems, vol. 8, no. 1, pp. 287–305, Feb. 2022, doi: https://doi.org/10.1007/s40747-021-00356-3

T. Szul, S. Tabor, and K. Pancerz, "Application of the BORUTA algorithm to input data selection for a model based on rough set theory (RST) to prediction energy consumption for building heating," Energies, vol. 14, no. 10, p. 2779, May 2021, doi: https://doi.org/10.3390/en14102779

B. Said, M. Lathamaheswari, P. K. Singh, A. A. Ouallane, A. Bakhouyi, A. Bakali, M. Talea, A. Dhital, and N. Deivanayagampillai, "An intelligent traffic control system using neutrosophic sets, rough sets, graph theory, fuzzy sets and its extended approach: a literature review," Neutrosophic Sets and Systems, vol. 50, pp. 10–26, Jun. 2022. Available: https://digitalrepository.unm.edu/nss_journal/vol50/iss1/2/

B. Halder, S. Mitra, and M. Mitra, "Classification of complete myocardial infarction using rule-based rough set method and rough set explorer system," IETE Journal of Research, vol. 68, no. 1, pp. 85–95, Jan. 2022, doi: https://doi.org/10.1080/03772063.2019.1588175

J. Xu, K. Shen, and L. Sun, "Multi-label feature selection based on fuzzy neighborhood rough sets," Complex & Intelligent Systems, vol. 8, no. 3, pp. 2105–2129, Jun. 2022, doi: https://doi.org/10.1007/s40747-021-00636-y

E. Forghani, R. Sheikh, S. M. Hosseini, and S. S. Sana, "The impact of digital marketing strategies on customer’s buying behavior in online shopping using the rough set theory," International Journal of System Assurance Engineering And Management, vol. 13, no. 2, pp. 625–640, Apr. 2022, doi: https://doi.org/10.1007/s13198-021-01315-4

C. Liu, X. Zhu, J. Zhang, M. Shen, K. Chen, X. Fu, L. Ma, X. Liu, C. Zhou, D. X. Zhou, and G. Wang, “eQTLs play critical roles in regulating gene expression and identifying key regulators in rice,” Plant Biotechnology Journal, vol. 20, no. 12, pp. 2357–2371, Dec. 2022, doi: https://doi.org/10.1111/pbi.13912

T. Wang and M. Zhou, "Integrating rough set theory with customer satisfaction to construct a novel approach for mining product design rules," Journal of Intelligent & Fuzzy Systems, vol. 41, no. 1, pp. 331–353, Aug. 2021, doi: https://doi.org/10.3233/JIFS-201829

Published

2025-05-31

Issue

Section

Articles