Will is a software developer on the corpus creation team. As the final step in the annotation pipeline, he aggregates data from disparate systems to produce machine learning language data in accessible and meaningful representations. Will is also involved in the design of data formats and storage systems. Will is interested in data-focused technologies including: relational databases, XML, and non-traditional data stores and their applications. Will takes a data-centric development philosophy paired with finding the appropriate tool sets and technologies.
When not developing, Will collaborates with coworkers to standardize efficient data production and management practices using existing and custom software and systems.