Data science cat and dog

Andrew Russell Green

Research, data science and software portfolio

Data science cat and dog

Andrew Russell Green

Research, data science and software portfolio

Contents

Welcome to my portfolio! I'm a mixed-methods researcher, data scientist and software developer. Here are some projects I've worked on. Thanks so much for stopping by.

P.S. You can also check out my blog here.

Wikimedia Fundraising

An analysis of donations to the Wikimedia Foundation based on publicly available data. By adjusting donation amounts for local inflation, I added information about the social processes behind the data. …→

Trend analysis
Data visualization
Data preparation
Python
Research methodology
Wikimedia Fundraising

Wikidata Metrics

An exploratory research project to define metrics about the use of Wikidata (structured data) on Wikipedia and elsewhere. We studied data sources and volunteer practices, created a conceptual framework, developed code and visualizations, and made product recommendations. …→

Metrics design
Product recommendations
Conceptual frameworks
Data visualization
SQL
Spark
Big data
Python
Writing
Wikidata Metrics

Wikifunctions UX

Mixed-methods, generative UX/design research to support a new Wikipedia feature: integration with a wiki of code called Wikifunctions. We interviewed 27 volunteers in 13 countries, made product recommendations, and learned about how wikis evolve. …→

UX/design research
Mixed-methods research
Research methodology
Product recommendations
Interviewing
Statistics
Data visualization
SQL
Python
Writing
Wikifunctions UX

Semantic Web, NLP and Archives

An interdisciplinary project to model archival metadata using the Semanitc Web, provide open access to cultural heritage, and incorporate natural language processing (NLP) in search. …→

Product management
Co-design
Semantic Web
Data modeling
NLP
Usability testing
Interdisciplinary research
Archival science
Java
SPARQL
Writing
Semantic Web, NLP and Archives

Ph.D.: Formal and Natural Languages

In my dissertation, I argue that formal languages (like computer code and mathematical formulas) are deeply similar to natural language. To support this view, I develop a theoretical framework and analyze a corpus of Semantic Web expressions. …→

Social Anthropology
Cognitive Linguistics
Cognitive Grammar
Semantic Web
Embodied Mathematics
Theories of culture
Naturalist Epistemology
Writing
Ph.D.: Formal and Natural Languages

Master's: Mixed-Methods Study of a Social Movement

Thesis about the core beliefs of a social movement in Mexico City, studied using a mixed-methods approach. …→

Social Anthropology
Mixed-methods research
Field work
Interviewing
Survey design
Document analysis
Discourse analysis
Writing
Master's: Mixed-Methods Study of a Social Movement

Publications

“Measuring Wikidata Usage on Other Wikis: Overview and Approach.” Wiki Workshop 2025. Wikimedia Foundation, 2025. PDF

Statement Signals: Measuring Wikidata Usage on Other Wikis. Research report. Wikimedia Deutschland, 2024. PDF

Connecting Wikifunctions to Wikipedia: Opportunities and Challenges. Research report. Wikimedia Foundation, 2024. PDF

Derrumbando la Barrera Cualitativa-Cuantitativa: Perspectivas sobre el Pensamiento, las Expresiones Formales, el Lenguaje y la Investigación Social. Ph.D. dissertation. Advisor: Ricardo Maldonado Soto. Instituto Nacional de Antropología e Historia, 2019. PDF

Tejedores de Imágenes. Propuestas metodológicas de Investigación y gestión del patrimonio fotográfico y audiovisual. Co-authors: Lourdes Roca, Felipe Morales and Carlos Hernández. Mexico City: Instituto Mora, 2014. Info
Winner of the 2014 Antonio García Cubas award for best textbook.

“Huellas de luz. Reflexiones metodológicas sobre investigación con imágenes y patrimonio”, in Imatge i Recerca: Jornades Antoni Varés (12es, 2012). Co-author: Lourdes Roca. Girona: Ayuntament de Girona, 2012. PDF

“Modular, Best-Practice Solutions for a Semantic Web-Based Digital Library Application”, in Ontologies: Reasoning and Modularity. Proceedings of the Workshop on Ontologies: Reasoning and Modularity (WORM-08). Co-author: José Antonio Villarreal Martínez. Stattler, U. and Tamilin, A., eds. In the series CEUR Workshop Proceedings. Vol. 348. Aachen: University of Aachen, 2008. PDF

“Search, Natural Language Generation and Record Display Configuration: Research Directions Stemming From a Digital Library Application Development Experience”, in Proceedings of the Workshop on Semantic Search (SemSearch 2008) at the 5th European Semantic Web Conference. Co-author: José Antonio Villarreal Martínez. Bloehdorn, S. et al., eds. In the series CEUR Workshop Proceedings. Vol. 334. Aachen: University of Aachen, 2008. PDF

“Metadatos transformados. Archivos digitales, la Web Semántica y el nuevo paradigma de la catalogación”, in Memorias de las V Jornadas sobre Imagen, Cultura y Tecnología. Madrid: Universidad Carlos III, 2006. PDF

“Rescate de la memoria”, in Ciencia y Desarrollo, Sept. 2006. Mexico: Consejo Nacional de Ciencia y Tecnología. PDF

“Proyecto de investigación para la creación de una Fototeca Digital y un Sistema de Información para Archivos Fotográficos”, in Anais do Museo Paulista. Historia e Cultura, no. 1, vol. 13, January-June 2005. Co-authors: Fernando Aguayo and Lourdes Roca. São Paulo: Museu Paulista. PDF

“Cambios actuales en el esquema de supuestos básicos de las prácticas catalográficas”. In Aldea Global no. 1 (formerly Metadata), 2003. PDF

“Durito, un proyecto de difusión y análisis digital con base en el software libre”, in Memorias del Primer Seminario Internacional “Los Archivos Sonoros y Visuales en América Latina”. Rodríguez Reséndiz, P. O., ed. Mexico City: Radio Educación, 2002. PDF