TY - GEN
T1 - Privacy Now or Never
T2 - 2023 ACM Symposium on Document Engineering, DocEng 2023
AU - Srinath, Mukund
AU - Matheson, Lee
AU - Venkit, Pranav Narayanan
AU - Zanfir-Fortuna, Gabriela
AU - Schaub, Florian
AU - Lee Giles, C.
AU - Wilson, Shomir
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/8/22
Y1 - 2023/8/22
N2 - The General Data Protection Regulation (GDPR) and other recent privacy laws require organizations to post their privacy policies, and place specific expectations on organisations' privacy practices. Privacy policies take the form of documents written in natural language, and one of the expectations placed upon them is that they remain up to date. To investigate legal compliance with this recency requirement at a large scale, we create a novel pipeline that includes crawling, regex-based extraction, candidate date classification and date object creation to extract updated and effective dates from privacy policies written in English. We then analyze patterns in policy dates using four web crawls and find that only about 40% of privacy policies online contain a date, thereby making it difficult to assess their regulatory compliance. We also find that updates in privacy policies are temporally concentrated around passage of laws regulating digital privacy (such as the GDPR), and that more popular domains are more likely to have policy dates as well as more likely to update their policies regularly.
AB - The General Data Protection Regulation (GDPR) and other recent privacy laws require organizations to post their privacy policies, and place specific expectations on organisations' privacy practices. Privacy policies take the form of documents written in natural language, and one of the expectations placed upon them is that they remain up to date. To investigate legal compliance with this recency requirement at a large scale, we create a novel pipeline that includes crawling, regex-based extraction, candidate date classification and date object creation to extract updated and effective dates from privacy policies written in English. We then analyze patterns in policy dates using four web crawls and find that only about 40% of privacy policies online contain a date, thereby making it difficult to assess their regulatory compliance. We also find that updates in privacy policies are temporally concentrated around passage of laws regulating digital privacy (such as the GDPR), and that more popular domains are more likely to have policy dates as well as more likely to update their policies regularly.
UR - http://www.scopus.com/inward/record.url?scp=85173559623&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173559623&partnerID=8YFLogxK
U2 - 10.1145/3573128.3609342
DO - 10.1145/3573128.3609342
M3 - Conference contribution
AN - SCOPUS:85173559623
T3 - DocEng 2023 - Proceedings of the 2023 ACM Symposium on Document Engineering
BT - DocEng 2023 - Proceedings of the 2023 ACM Symposium on Document Engineering
PB - Association for Computing Machinery, Inc
Y2 - 22 August 2023 through 25 August 2023
ER -