Skip to content
Change the repository type filter

All

    Repositories list

    • template_project

      Public template
      Template for starting DataLab research projects.
      0000Updated Jan 17, 2025Jan 17, 2025
    • Scraping and visualizing the UC Davis Potential Worksite Exposure Reporting (AB 685) data
      Jupyter Notebook
      MIT License
      4660Updated Jan 1, 2025Jan 1, 2025
    • chronam

      Public
      R package for interfacing with chronicling america
      R
      1250Updated Apr 19, 2024Apr 19, 2024
    • Code supporting the transfer of data from Notion to Zotero
      TeX
      1100Updated Dec 20, 2023Dec 20, 2023
    • Claire Graves' 2022 collaboration to identify access to endocrine surgery centers
      R
      GNU General Public License v3.0
      1140Updated Nov 14, 2023Nov 14, 2023
    • archv-py

      Public
      python implementation of archv
      Python
      0121Updated Feb 11, 2023Feb 11, 2023
    • ava

      Public
      Mirror of the UC Davis Library's American Viticultural Areas Repository
      R
      Creative Commons Zero v1.0 Universal
      57000Updated Jun 22, 2022Jun 22, 2022
    • Interface to tesseract OCR system.
      R
      4100Updated Apr 15, 2022Apr 15, 2022
    • Repository for the Quintessence Web Project applying Topic Models and Word Embeddings to EEBO-TCP
      JavaScript
      00120Updated Jul 2, 2021Jul 2, 2021
    • All the scripts we use for analysis
      Python
      0070Updated Mar 28, 2021Mar 28, 2021
    • Lex
      0010Updated Nov 14, 2020Nov 14, 2020
    • R
      5000Updated Apr 28, 2020Apr 28, 2020
    • A basic LAMP stack environment built using Docker Compose.
      Dockerfile
      MIT License
      1.4k000Updated Apr 28, 2020Apr 28, 2020
    • ReadPDF

      Public
      Tools for working with PDF documents, currently converted to XML via a modified pdftohtml
      R
      4000Updated Aug 20, 2019Aug 20, 2019
    • Notes about how to organize the OCR and PDF document reading code
      R
      1000Updated Aug 14, 2019Aug 14, 2019
    • High-level package for reading academic journal articles and extracting the contents in a structured manner, building on the low-level and intermediate-level packages ReadPDF,Rtesseract and GetDocElement
      R
      1000Updated Jul 7, 2019Jul 7, 2019
    • Mid-level package that uses Rtesseract and ReadPDF to get the intermediate-level elements from a document, e.g., table, title, sections, text.
      R
      1000Updated Jul 7, 2019Jul 7, 2019
    • Package for reading data from tables from PDF and Scanned(OCRed) documents.
      R
      1000Updated Jun 10, 2019Jun 10, 2019
    • pdftohtml

      Public
      copy of pdftohtml code with enhancements
      C++
      6000Updated Mar 6, 2019Mar 6, 2019