My Projects<!-- --> | <!-- -->Ben Pettis
Ben Pettis

My Projects

Birdwatch Archive

web

October 18, 2023

The old Twitter bird logo placed inside a drawer alongside a magnifying glass.

Community Notes (formerly named “Birdwatch”) is Twitter’s crowdsourced fact-checking program to combat mis- and dis-information. By signing up to be a Birdwatch contributor, a user can add contextual notes and commentary to other tweets as well as rate the contributions of others. User submissions to the Community Notes program also serve as metacommentary on the platform more generally. Beyond their fact-checking role, Birdwatch notes also illuminate how some users perceived Elon Musk’s recent purchase of the platform and how the subsequent changes aligned with their own understandings of what the platform ought to be. This project describes my attempt to preserve and parse the data that Twitter publishes from the crowd-sourced factchecking program. I have developed a web app which regularly downloads the TSV files that Twitter posts on its website, imports them into a Postgres database, and then displays these online for searching and filtering.

Read More...

Twitter Archive

other

July 6, 2023

Screenshot of the twitter-archive.benpettis.com website. It is a simple page, with black text on a white background and the title 'Welcome to the @ben_pettis_ Twitter archive'

I archived my old Twitter data, and then hosted the resulting files in a Google Cloud Storage bucket. The cost works out to be just a few cents every month, and I can now provide permalinks to all my old Twitter content——now hosted entirely on my own domain. If you're interested in doing something similar with your own data, I've put together an overview of what I did.

Read More...

Warner Bros. 100th Anniversary

web

April 4, 2023

A screenshot of a website displaying two rows of images. Each image is the cover of a Warner Bros pressbook.

2023 is the centennial anniversary for the Warner Bros. studio. The Media History Digital Library (MHDL) has hundreds of high-resolution scans of Warner Bros. pressbooks—promotional publications that the studio produced with recommendations for theater owners on how best to promote each film. As part of my ongoing work for the MHDL, I produced a dynamic online visualization of all these pressbook covers.

Read More...

WCFTR Website

web

January 13, 2023

A screenshot of the homepage of the WCFTR website. There is a black horizontal menu bar, a paragraph of text, and a red box with a photograph of a reading room.

During the Fall of 2022, I worked on a redesign of the website for the Wisconsin Center for Film and Theater Research (WCFTR). The previous WCFTR website used Drupal as its content management system. In addition to the design being somewhat dated, Drupal was significantly limiting the functionality and usability of the site. I assisted with migrating that old website's content to WordPress. We are using an adapted version of the University of Wisconsin-Madison's branded WordPress theme, more clearly designating the WCFTR as part of the campus community. Additionally, the new WordPress website integrates more effectively with the department's other websites and has helped WCFTR staff to be able to more easily add new content.

Read More...

Self-Hosted Mastodon Instance

other

December 20, 2022

A Photoshopped image of a cartoon mastodon sitting at a set of computer monitors. There is a caption which reads 'I HAVE NO IDEA WHAT I'M DOING'

I first joined Mastodon in mid-2022 when there were initial whispers of Elon Musk wanting to purchase Twitter. I still kept using Twitter, but created an account on mastodon.social to start learning about the platform. After Musk's official takeover, I started using Mastodon much more frequently and by the end of November I wasn't using Twitter at all anymore. Given that I was now pretty committed to participating in the fediverse, I decided to take the plunge and try to self-host my own Mastodon instance on my own server hardware.

Read More...

Muskless Twitter

web

November 15, 2022

A screenshot of the musklesstwitter.com website. There is a simplified graphic of a smartphone at the center and the heading 'Twitter now with 1000% less Musk!' Beneath this there is a fake social media newsfeed

In late 2022, some rich guy called Elon Musk bought Twitter and almost immediately began running it into the ground. This prompted many people (myself included) to leave the platform and seek out alternative social media platforms. I created this super basic webpage to give us what we all really want - the ability to type something in a box and feel like it goes somewhere—the digital equivalent of screaming into the void. All we can hope is that the void doesn't scream back at us this time.

Read More...

Thinking About Media!

web

November 1, 2022

A screenshot of a website with white text on a dark grey background. The text reads: "Trust me, bro": A (re)imagination of Thomas the Tank Engine Since 2020 it has become increasingly apparent that Barack Obama has had an outspoken influence on digital native. But this relationship is not limited to modern media contexts. As early as 1850 early instances of radicalization could be seen emerging with an orientation toward the future. In this paper, I argue that Geocities shows that scholars must return to Geocities. To do this, the paper draws upon media industries frameworks to inform its use of Critical Technocultural Discourse Analysis to critically examine Thomas the Tank Engine.

Inspired by @matthew_paul's #TeamRhetoric Twitter Bot, the website uses a JavaScript library called Tracery, originally written by Kate Compton. Using a series of template phrases and mad-libs style replacements, the site generates paper topics and abstracts fitting for any media studies scholar!

Read More...

Media History Digital Library Website

web

August 31, 2022

A screenshot of the homepage of the MHDL website. There MHDL logo is in the center of a grid of several colorful magazine covers.

During the Summer of 2022, I have been working with Eric Hoyt to redesign and improve the functionality of the Media History Digital Library website. The previous version of the website had an outdated design, and used almost entirely static HTML and CSS, which required significant manual work to keep up to date. This new version is built with PHP and is able to automatically query our Solr instance to display up-to-date content and information.

Read More...

HTML Search and Record

other

May 20, 2022

A cartoon image of an open file folder with the Google reCAPTCHA logo appearing out of it

This is a Chrome extension that detects the presence of reCAPTCHAs on a Web page and invites the user to record and preserve their interaction. By detecting specified HTML elements within a Web page, the extension enables researchers to preserve users’ interactions with an interface without needing to continuously (and invasively) record their browsing. The extension aims to balance the priorities of Web preservation with user privacy and autonomy. This represents a new approach to Web preservation that may be useful to other digital humanities projects by attending to ephemeral user interactions that other preservation tools are not as well-suited for.

Read More...

Tracking the HTTP 451 Error Code

web

May 15, 2022

A cartoon image of a server icon with a flame and cloud in the background

This project automatically tracks use of the HTTP 451 (Not Available for Legal Reasons) code, how frequently it's used, and archiving error pages. Geoblocking is a reality of the modern Web, and the code could be appropriate to use in some of these situations. We know that gov censorship is a reality in many places, but despite all the restrictions on the full flow of content online, this specific HTTP response code is actually not used all that frequently!

Read More...

Lantern 2.0

web

May 1, 2022

The Lantern logo. It is a cerulean blue color. There is a mining-style lantern on the left, and the word 'lantern' in curvy lowercase letters beside it.

During the Summer and Fall of 2021, I have been working with Eric Hoyt to upgrade and the backend and redesign the interface for Lantern, the search platform for the Media History Digital Library. This has been a highly involved project, and has required me to essentially recreate the entire website with a newer version of Ruby on Rails to use a more recent version of Blacklight. We fully launched the new version in Fall 2022.

Read More...

Crash Course to Photoshop

video

March 15, 2022

A screenshot of a computer screen with a Photoshop document open. The center of the document has a black background, and there are several shapes and lines crudely drawn in. In the top right corner of the image there is a picture-in-picture view of a person sitting at a computer looking toward the camera

I produced this introductory crash course video to use in a class I was teaching as a way to provide to students as a supplementary resource. The goal was to include just enough information for them to get comfortable with the interface and begin experimenting with the tools they would need to use for the project. Additionally, I wanted to design the video so that it would be easy for students to skip back and forth to the sections they are most interested in.

Read More...

Adobe Acrobat PDF Fixer

other

January 1, 2022

A screenshot of the Adobe Acrobat interface. It is in dark mode, so the windows are a dark grey and there is a single page of text in the center

Do you have a professor who provides PDFs that look like they’ve been run through a meat grinder? Are you a professor who provides PDFs that contain multiple scanned pages on a single page? We’ve all been there – someone has generously provided us with a PDF copy of a book chapter or article. But there’s one problem – none of the text is actually searchable. And each page of the PDF actually contains two pages of scanned text. PDFs in this format are not accessible to people using screen readers, and keeping multiple scanned pages on each PDF page can make navigating the document inconvenient and cumbersome. Adobe Acrobat can make those PDFs more usable, but manually fixing your scans each time is tedious. This Acrobat Action will semi-automatically process a scanned PDF to separate each scanned page onto its own PDF page and run the entire document through OCR to create searchable and selectable text.

Read More...

Beside the Rabbit Hole Podcast

other

November 1, 2021

A photo of a yellow warning sign showing a person tripping into a hole. There is text above which reads 'beside the rabbit hole'

During one of my graduate seminars, in addition to thinking and reading about Sound Studies, we also learned some audio production techniques. As a project in the class, I produced this podcast in which I share the project I was reasearching. There was a time when I was naive and thought that I might have the time to continue making more seasons and episodes about my work. But the reality is that I have not had the time to actually go about doing this. Perhaps one day...

Read More...

Alternative Implicit Association Test

web

May 1, 2021

A screenshot of a website. It has a white background and a large header which says 'Implicit Bias Test About Crime'

This is an alternative Implicit Bias Test created as part of the Code and Power course (LIS500) in the University of Wisconsin-Madison's iSchool. Our Implicit Bias Test about Crime aims to test each individual’s biases towards race and gender through a series of questions along with a summary of results. Each question is directed at the individual’s thought process, thus giving them a score determining how biased they are towards each photo in relation to crime. Safiya Noble (2018) has argued that even in cases where people are attempting to seek accurate information, search engines can nevertheless feed their confirmation biases. We recognize there may be some limitations in our test as there is no exact “right” answer; however, it allows each individual to compare their answers with the public and to reflect on their own choices. Ultimately, the test gathers information on one’s implicit racial and gender biases within crime by using timed elements and numerical responses.

Read More...

Jazz Genius

web

May 1, 2021

A black and white photograph of Louis Armstrong mid-bite with a large forkful of spaghetti

Jazz Genius is an experimental Digital Humanities project that interrogates the conventions and trends of jazz lyrics. The website contains thousands of songs collected from Genius.com and enables users to browse this collection and explore connections. Jazz Genius also offers several tools for analyzing this collection—including topic analyses, TEI markup, and various data visualizations.

Read More...

The Tumblr Porn Ban

video

June 1, 2020

A photo of Ben Pettis standing in front of a blank wall with a confused look on his face. There is photo over the image which reads 'Female-Presenting Nipples'

I spent about a year of my life researching and writing the thesis for my M.A. in communication studies. In the interest of making my research a bit more accessible, I wanted to explain my thesis in a shorter form. In the thesis, I examine the social network site (SNS) Tumblr and the controversy that surrounded its recently amended community guidelines and adult content policy. Tumblr had previously had somewhat of an 'alternative' identity as compared to mainstream SNSs such as Facebook or Twitter. This identity had largely resulted from its previously lax policy toward pornography and other adult content. Such content had previously been allowed on the website, which enabled a wide degree of personal freedom and expression.

Read More...

Public Speaking - Online Review Videos

video

May 1, 2020

A black screen with title text on it. There is white text on a green rectangle which says 'Parts of an Introduction' and white text on the black background which says 'SPCM 200 - Final Exam Review'

During the Spring 2020 semester, I taught multiple sections of SPCM 200 - Intro to Public Speaking. Each section had approximately 20 students, and the course covered basic principles of public speakig, along with fundamentals of research, writing, and argumentation. In March 2020, Colorado State University—like many others—transitioned to fully remote instruction in response to the COVID-19 pandemic. We transitioned SPCM 200 to an asynchronous format, and began delivering course materials primairly through pre-recorded videos. At the end of the semester, I produced many short videos such as these to help review various parts of course content before the final exam.

Read More...

The Mall

other

January 1, 2020

A brown background with several white rectangles with small drawings on them scattered throughout. The white rectangles are connected to each other with small orange lines, like a web

For some reason, when I was in elementary school I became obsessed with creating and drawing imaginary storefronts and the various items that each of them sold. The drawing skills are sub-par at best, and the actual spatial organization of the mall lacks any logic or structure whatsoever. But it was the third grade. Sue me. Years later, while visiting my parents and cleaning out some of my old stuff I rediscovered a folder full of these drawings. Now armed with the technical know-how (aka the confidence to Google and poke around with basic JavaScript), I set out to scan these old drawings and finally connect all these stores with one another like younger-me had always envisioned.

Read More...

4chan Scraper

other

January 1, 2019

A screenshot of the 4chan /pol/ politically incorrect imageboard.

This simple Python script uses 4chan's read-only APIs to scrape the information from the front page of a given imageboard. In addition to saving every image posted to the board, the script will also generate multiple CSV files that record which threads were on the front page at a given time. A folder is generated for each thread's images, as well as an individual CSV file that records each reply in the thread as well. I have done some research on anonymous online communities, the ways they communicate with one another, and how they're able to influence real events in the physical world. Rather than manually browsing and downloading content from 4chan imageboards, I built this script to automatically scrape the most recent content from a given 4chan imageboard.

Read More...

.org Doesn't Mean Credible

web

January 1, 2018

A screenshot of a website. In the screenshot there is a shooting star with the words 'the more your know' in its tail.

Many textbooks, online research guides, and other resources claim that a .org domain name is an indicator of credibility. The common explanation is that only non-profits, professional associations, and other organizations are able to register a .org domain name. One of the course objectives for SPCM 200 (Public Speaking) at CSU was to develop research skills, including practice evaluating the credibility of web sources. To that end, I wanted to teach students that there are rarely hard and fast rules to immediately assess a website – such as looking at the URL. Instead, they should expect to think critically about the web page and its content. To demonstrate this point, I purchased a domain name and created this simple website..

Read More...

Trumpster Fire Twitter Bot

web

January 1, 2018

A photograph of a large dumpster with flames coming out of it. The flames have been edited to appear as the face of President Donald Trump

I wrote a simple Twitter bot to represent anything tweeted by the President in a somewhat different format. Running on a simple Virtual Machine, the bot checks Trump’s Twitter feed every 10 minutes for new posts. Whenever it detects a new tweet, it pulls the text and overlays it atop an animated GIF of a dumpster fire using a python script and the Python Image Library (PIL).

Read More...