Periodic Reporting for period 3 - UnStruct (Structural Models for Text and other Unstructured Data)
Période du rapport: 2022-11-01 au 2024-04-30
Another important example is the measurement of remote work offering from an enormous corpus of job postings from Lightcast. The shift to remote work since the pandemic has been one of the largest disruptions to the labor market since WWII. Survey responses take us some way in understanding the incidence of remote work, but limited samples create obstacles to, for example, reporting remote work levels and growth rates in cities and firms. My coauthors and I use a large-language model along with a set of human-labeled job postings to build a custom model that maps the text of any job ad into a classification of whether it offers remote work or not. We then apply it to hundreds of millions of job ads and document extensive heterogeneity in adoption. Data is available for download at https://wfhmap.com/(s’ouvre dans une nouvelle fenêtre) and can again be used to inform numerous crucial economic questions.
Other important contributions to date document how inflation uncertainty as measured from text leads monetary policymakers to adopt more aggressive policy stances, and how to measure whether social connections between individuals generates correlated outcomes. The project has also produced a summary of text-as-data methods for economists and example code for implementing a host of modern algorithms.