Projects per year
Personal profile
Research interests
[Note: This profile is incomplete, especially with regard to my publications. See http://shomir.net for many more.]
My research brings together natural language processing (NLP), privacy, and artificial intelligence.
I am interested in solving problems to enable computers to do meaningful work with large volumes of natural language text. My lab develops new methods for NLP and applies them to a variety of domains, including privacy, online social networks, web science, and digital libraries. I am particularly interested in breaking down technology's "walls of text", i.e., situations where a human user or decision-maker is expected to consume a large quantity of text to take action while lacking sufficient resources (time, expertise) to properly understand what they have been given. I have applied this paradigm to privacy policies, scholarly manuscripts, documents from the world wide web, and historical texts, and I am always interested in new domains to work with.
Personal profile
I am an Assistant Professor in the College of Information Sciences and Technology at Penn State, where I lead the Human Language Technologies Lab. I am also a Faculty Affiliate of Penn State's Institute for CyberScience and a member of the Social Data Analytics graduate faculty.
From 2016 until 2018 I was an Assistant Professor in the EECS Department at the University of Cincinnati. Prior to that I was a postdoc and a lecturer in Carnegie Mellon University's School of Computer Science and an NSF International Research Fellow in the University of Edinburgh's School of Informatics. I received my PhD in Computer Science from the University of Maryland in 2011.
Expertise related to UN Sustainable Development Goals
In 2015, UN member states agreed to 17 global Sustainable Development Goals (SDGs) to end poverty, protect the planet and ensure prosperity for all. This person’s work contributes towards the following SDG(s):
Education/Academic qualification
Computer Science, PhD, University of Maryland
Award Date: May 1 2011
Computer Science, M.S., University of Maryland
Award Date: May 1 2008
Computer Science, B.S., Virginia Tech
Award Date: May 1 2005
Mathematics, B.S, Virginia Tech
Award Date: May 1 2005
Philosophy, B.A., Virginia Tech
Award Date: May 1 2005
Researcher Defined Keywords
- natural language processing
- computational linguistics
- privacy
- artificial intelligence
Fingerprint
- 1 Similar Profiles
Collaborations and top research areas from the last five years
Projects
- 3 Active
-
CAREER: Large-Scale Exploration and Interpretation of Consumer-Oriented Legal Documents
8/1/23 → 7/31/28
Project: Research project
-
SaTC: CORE: Small: Toward Privacy Equity through Contextual Understanding of Self-Disclosure
6/1/23 → 5/31/26
Project: Research project
-
SaTC: CORE: Medium: Collaborative: Automatically Answering People's Privacy Questions
7/15/19 → 12/31/23
Project: Research project
-
Creation and Analysis of a Corpus of Scam Emails Targeting Universities
Ciambrone, G. & Wilson, S., Apr 30 2023, ACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023. Association for Computing Machinery, Inc, p. 24-27 4 p. (ACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
-
Nationality Bias in Text Generation
Venkit, P. N., Gautam, S., Panchanadikar, R., Huang, T. H. & Wilson, S., 2023, EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 116-122 7 p. (EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
2 Scopus citations -
A Study of Implicit Language Model Bias Against People With Disabilities
Venkit, P. N., Srinath, M. & Wilson, S., 2022, In: Proceedings - International Conference on Computational Linguistics, COLING. 29, 1, p. 1324-1332 9 p.Research output: Contribution to journal › Conference article › peer-review
9 Scopus citations -
A Tale of Two Regulatory Regimes: Creation and Analysis of a Bilingual Privacy Policy Corpus
Arora, S., Hosseini, H., Utz, C., Kumar, V. B., Dhellemmes, T., Ravichander, A., Story, P., Mangat, J., Chen, R., Degeling, M., Norton, T., Hupperich, T., Wilson, S. & Sadeh, N., 2022, 2022 Language Resources and Evaluation Conference, LREC 2022. Calzolari, N., Bechet, F., Blache, P., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Odijk, J. & Piperidis, S. (eds.). European Language Resources Association (ELRA), p. 5460-5472 13 p. (2022 Language Resources and Evaluation Conference, LREC 2022).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
1 Scopus citations -
Automated Detection of Doxing on Twitter
Karimi, Y., Squicciarini, A. & Wilson, S., Nov 11 2022, In: Proceedings of the ACM on Human-Computer Interaction. 6, CSCW2, 276.Research output: Contribution to journal › Article › peer-review
2 Scopus citations