The projects below are a subset of the digital humanities work currently taking place at UC Berkeley. This collection is part of the 2020 Digital Humanities Fair -- fully online during the week of April 13-16.

A Collaborative Approach for GIS Historical Maps Metadata Project

In the changing landscape of digital research and open access, the roles of technical services librarians are expected to be more creative than traditional cataloging and simple metadata creation. One of the possible new roles is supporting digital humanities research by organizing, managing, and providing access to data sets via metadata creation and management.

Read more

Ancient World Computational Analysis (AWCA)

AWCA began 20 years ago as a digitization project. Since 2019 the analytical workflow has been built at the D-Lab by myself and a team of DS-Discovery students. The goal is to build a citation network from any collection of PDFs. The project data mines a large collection of sources from the disciplines of ancient Near Eastern Studies, Classics, Archaeology, and Middle Eastern Languages.

Read more

Do Presidents Keep Their Promises?

Many presidential hopefuls see the best way to win over voters as promising to tackle a prominent issue of their day. With President Obama, it was healthcare; with President George W. Bush, it was tax-cuts. It is often wondered whether these issues that they speak about during pre-election debates will actually be what they focus on most while in office.

Read more

Explore Who Is the True Author of the Dream of the Red Chamber (A Famous Chinese novel) & the Character Relationships

My project is about the Dream of the Red Chamber which is a great novel, maybe the greatest one in the long history of China for thousands of years. The book is also shrouded in mystery. No one knows who wrote the last 40 chapters. Some believe that it was written by Cao Xueqin, who is also the author of the previous chapters, while others believe that the last 40 chapters was written by another author, whose name is Gaoe. Not only do I want to find out the true author of the last 40 chapters, but I also want to analyze the characters’ relationships.

Read more

Historic Places in Merced

This project explores community-based research in the context of Digital Preservation. A group of graduate students from UC Merced is developing an active collaboration with two local groups within the San Joaquin’s valley - the Courthouse Museum of Merced (Merced Historical Society) and the Downtown Neighborhood Association (DNA) - in order to explore the Built Heritage of Merced in relation to its communities, the formation of identities in San Joaquin’s Valley California and the development of the town of Merced.

Read more

Measuring Hate Speech by Integrating Ordinal, Multitask Deep Learning with Faceted Rasch Modeling

Outcome phenomena are typically measured at the binary level: a comment is toxic or not, an MRI scan shows cancer or is clear, a patient is diagnosed as having a disease or not. But underlying that dichotomization there is often a continuous spectrum or latent variable. Physical quantities such as temperature and weight can be measured as interval variables where magnitudes are meaningful. How can we achieve that same interval measurement for arbitrary outcomes - creating continuous scales with magnitudes?

Read more

Public Humanities Design Studio:  A New Digital Humanities Lab at UC Merced

The Public Humanities Design Studio is UC Merced's new public and digital humanities collaborative space. This poster will highlight the process of setting up a new digital humanities lab and will touch on some of the challenges that arose from the recent COVID-19 pandemic.

Read more

Sentiment Analysis and Discovering Linguistic Relationships in Financial News Articles

Financial news articles are a challenge to analyze because of the unique style and vocabulary utilized in the domain. Many terms are hyperspecific and may not be known to casual readers while being commonplace to an experienced reader, i.e. 'alpha-generating', 'floor', 'priced in', 'resistance', etc. Financial sentiment analysis is a niche field, plagued with many challenges. Most of the data is unlabeled, and general language models don’t work well on financial articles because of the unique style and vocabulary in said articles.

Read more

Sumerian Networks: Reconstructing Ur III

The goal of the Sumerian Network project has been to build reproducible socio-economic networks from the Ur III textual archives. We applied novel computational methods for name disambiguation, and in the absence of 'gold-standard' data, we built an atomized network based on name instances in textual attestations for c. 15,000 documents.

Read more

Text Analysis: Writings of Early European Explorers and Naturalists

In 1492, Christopher Columbus became the first European to set foot in the Americas, ushering in an era of European colonialism, exploration, and exploitation across the globe. The Americas were thought of as the “New World”, a uncharted continent rich with resources and land open for the taking. This thought would be echoed again in the 19th century with European colonization of Africa, famously called the “Scramble for Africa”, when European powers divided up the continent and claimed pieces for themselves.

Read more

The Book of the Dead in 3D

Techniques of digital capture and 3D visualizations, in particular photogrammetry, are currently widely used in Egyptology in order to create replicas of objects, as well as reconstruct archaeological sites, tombs, and temples. Since 2015, the "Book of the Dead in 3D" project has implemented 3D visualizations of ancient Egyptian coffins at the Phoebe A. Hearst Museum and other local institutions. The goal of the project is to analyse the texts on the coffins in order to create interactive annotations on the models themselves.

Read more

The Warren Court - Language Modeling & Text Analysis

The Warren Court acted as one of the most influential periods in Supreme Court history. It incorporated much of the Bill of Rights, strengthened the rights of the accused, established the right to privacy, and, most notably, declared segregation unconstitutional in Brown v. Board of Education. Using the Jupyter Notebooks from Data 88: Language Modeling and Text Analysis, I constructed multi-dimensional language models from unstructured textual data.

Read more

Using an Adaptive Learning Algorithm to Encourage Physical Activity in Low-Income Ethnic Minority Patients with Depression and Diabetes

Introduction: Depression and diabetes are highly disabling diseases with high comorbidity. Though typically treated separately, they share a common risk factor: low physical activity. Mobile apps might be effective self-management tools to help patients increase physical activity. However, most digital interventions do not tailor their content to individual users, which might impact their effectiveness.

Read more

Visualizing Ancient Egyptian landscapes and material culture: Cultural contexts for immersive visualization and VR

The goals of the this project are to develop and integrate applications of visualization technologies in the fields of archaeology, epigraphy, history of art and history of ancient Egypt, into immersive visualizations and VR/AR applications that allow scholars, students, and the interested public to navigate from the large landscape, to the monument, and to the magically inscribed sarcophagus in its ancient Egyptian context.

Read more

What’s the Difference between Science Fictions?

According to the definition of Wikipedia, Science Fiction (Sci-fi) is a speculative fiction with imaginative and futuristic concepts including advanced science and technology, space exploration, time travel, parallel universes, and extraterrestrial life. Sci-fi also explores the potential consequences of scientific, social, and technological innovations. Science fiction can be traced back to ancient times, when the precursors of Sci-fi were related to fantasy, horror, and superhero mythologies.

Read more