Unveiling Privacy Policy Complexity: An Exploratory Study Using Graph Mining, Machine Learning, and Natural Language Processing

Research output: Contribution to book or proceedingConference articlepeer-review

Abstract

Privacy policy documents are often lengthy, complex, and difficult for non-expert users to interpret, leading to a lack of transparency regarding the collection, processing, and sharing of personal data. As concerns over online privacy grow, it is essential to develop automated tools capable of analyzing privacy policies and identifying potential risks. In this study, we explore the potential of interactive graph visualizations to enhance user understanding of privacy policies by representing policy terms as structured graph models. This approach makes complex relationships more accessible and enables users to make informed decisions about their personal data (RQ1). We also employ graph mining algorithms to identify key themes, such as User Activity and Device Information, using dimensionality reduction techniques like t-SNE and PCA to assess clustering effectiveness. Our findings reveal that graph-based clustering improves policy content interpretability. It highlights patterns in user tracking and data sharing, which supports forensic investigations and identifies regulatory non-compliance. This research advances AI-driven tools for auditing privacy policies by integrating interactive visualizations with graph mining. Enhanced transparency fosters accountability and trust.

Original languageEnglish
Title of host publication2025 6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages514-520
Number of pages7
ISBN (Electronic)9798331543488
ISBN (Print)9798331543488
DOIs
StatePublished - May 7 2025
Event6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025 - Savannah, United States
Duration: May 7 2025May 9 2025

Publication series

Name2025 6th International Conference on Artificial Intelligence, Robotics and Control (AIRC)

Conference

Conference6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025
Country/TerritoryUnited States
CitySavannah
Period05/7/2505/9/25

Scopus Subject Areas

  • Mechanical Engineering
  • Control and Optimization
  • Artificial Intelligence
  • Computer Science Applications

Keywords

  • Privacy Policy Document Analysis Graph Mining Dimensionality Reduction (DR) Machine Learning (ML) NLP Legal Documents AI-driven Compliance Data Transparency

Fingerprint

Dive into the research topics of 'Unveiling Privacy Policy Complexity: An Exploratory Study Using Graph Mining, Machine Learning, and Natural Language Processing'. Together they form a unique fingerprint.

Cite this