TY - GEN
T1 - Is Invisible XML Ready for College Students? Trying iXML and XProc on a Music Analysis Project in an Undergraduate Text Analysis Course
AU - Simons, Michael Roy
AU - Beshero-Bondar, Elisa E.
N1 - Publisher Copyright:
Copyright © 2025 by the authors.
PY - 2025
Y1 - 2025
N2 - Is invisible XML ready for teaching university undergraduates? Is it a good idea to try this? This paper will attempt to address these questions. University students in the Digital Media, Arts, and Technology program at Penn State Behrend are offered a course in “Large-Scale Text Analysis”. Going into this course, students have experience in encoding text with XML, transforming XML with XSLT, and web development with HTML and CSS. In the past, the Text Analysis course has been a procedural “Regex- and-Python course”: preparing text corpora by generating simple XML from regularly-patterned files using regular expression search-and-replace operations, using XQuery to extract the portions of the texts to analyze, and producing plain-text inputs to provide to Python. Python has dominated the experience of the pipeline. This year’s course tried a different approach. Students were taught iXML grammars as a way to prepare XML for analysis and XProc for pipelining. Regular expression matching involved working with XSLT, and the entire XML stack was used before approaching Python. Students learned how to install software in alpha stages, and they tested how well it works across platforms. From this exploratory start, one student project team found a very practical use-case for applying invisible XML in a project pipeline for analyzing chord chart musical notation. In this paper, we discuss the potential we discovered for invisible XML in music analysis. We also share our recommendations for guiding people to prepare processing pipelines that incorporate invisible XML, and we reflect on what aspects of this risky teaching experiment were most worthwhile.
AB - Is invisible XML ready for teaching university undergraduates? Is it a good idea to try this? This paper will attempt to address these questions. University students in the Digital Media, Arts, and Technology program at Penn State Behrend are offered a course in “Large-Scale Text Analysis”. Going into this course, students have experience in encoding text with XML, transforming XML with XSLT, and web development with HTML and CSS. In the past, the Text Analysis course has been a procedural “Regex- and-Python course”: preparing text corpora by generating simple XML from regularly-patterned files using regular expression search-and-replace operations, using XQuery to extract the portions of the texts to analyze, and producing plain-text inputs to provide to Python. Python has dominated the experience of the pipeline. This year’s course tried a different approach. Students were taught iXML grammars as a way to prepare XML for analysis and XProc for pipelining. Regular expression matching involved working with XSLT, and the entire XML stack was used before approaching Python. Students learned how to install software in alpha stages, and they tested how well it works across platforms. From this exploratory start, one student project team found a very practical use-case for applying invisible XML in a project pipeline for analyzing chord chart musical notation. In this paper, we discuss the potential we discovered for invisible XML in music analysis. We also share our recommendations for guiding people to prepare processing pipelines that incorporate invisible XML, and we reflect on what aspects of this risky teaching experiment were most worthwhile.
UR - https://www.scopus.com/pages/publications/105017843528
UR - https://www.scopus.com/pages/publications/105017843528#tab=citedBy
U2 - 10.4242/BalisageVol30.Beshero-Bondar01
DO - 10.4242/BalisageVol30.Beshero-Bondar01
M3 - Conference contribution
AN - SCOPUS:105017843528
T3 - Balisage Series on Markup Technologies
BT - Proceedings of Balisage
PB - Mulberry Tecnologies, Inc.
T2 - Balisage: The Markup Conference 2025
Y2 - 4 August 2025 through 8 August 2025
ER -