Skip to content

Instantly share code, notes, and snippets.

@DejfCold
Created December 14, 2021 03:50
Show Gist options
  • Select an option

  • Save DejfCold/4894bbb2ae5ae4c68aac13016ea1fee5 to your computer and use it in GitHub Desktop.

Select an option

Save DejfCold/4894bbb2ae5ae4c68aac13016ea1fee5 to your computer and use it in GitHub Desktop.
Web Archive a "Dezinformátoři" na koronavirus.mzcr.cz
import requests
# https://twitter.com/maestrosill/status/1470496141884235793
r = requests.get('http://web.archive.org/cdx/search/cdx?url=koronavirus.mzcr.cz/&fl=timestamp&collapse=digest')
timestamps = r.text.splitlines()
lenght = len(timestamps)
for idx, timestamp in enumerate(timestamps):
print(f'{idx + 1} / {lenght}')
url = f'http://web.archive.org/web/{timestamp}/https://koronavirus.mzcr.cz/'
r = requests.get(url)
content = r.text
if 'Dezinformátoři' in content:
print(f'Dezinformatori jsou na {url}')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment