Skip to content

Instantly share code, notes, and snippets.

View ldko's full-sized avatar

Lauren Ko ldko

  • UNT Libraries
  • Denton, TX
View GitHub Profile
@ldko
ldko / heritrix_from_source.md
Last active December 5, 2024 21:26
Building Heritrix from main branch for use with ExtractorYoutubeDL

Install maven and java with coordinating versions that work with your system and Heritrix (see https://github.com/internetarchive/heritrix3/blob/master/.github/workflows/maven.yml):

sudo dnf install maven-openjdk17

Building latest Heritrix (in this example, I put heritrix distributions at /usr/local/ and have a symlink named h3 that points to the one I currently use, i.e. /usr/local/h3 is a symlink pointing to /usr/local/heritrix-3.6.1-SNAPSHOT-20241205)

cd /tmp
git clone https://github.com/internetarchive/heritrix3.git
cd heritrix3