TXT to JSON

Parse Screaming Frog exports into structured JSON — auto-detects HTML source for best accuracy

📝
Drop Screaming Frog page files here
or browse files — .html (best) or .txt accepted
Select files to add to queue
Select a client to browse page files from Screaming Frog crawls (HTML or TXT).
No files queued. Select a client and upload .html or .txt files to start.

How it works

Upload Screaming Frog exports for the client site or any competitor. HTML source files give perfect accuracy; TXT files use smart heuristics enriched with heading CSVs when available.

  • HTML source — perfect H1-H6, meta tags, images
  • TXT + CSV enrichment — exact H1/H2 from CSVs
  • TXT heuristic — fallback when no CSVs available
  • From Storage — auto-detects best format
  • Download All — ZIP with client + competitor folders
  • Navigation & form noise stripped
  • Flat, ordered content blocks

Storage paths

Source page_text files from Screaming Frog crawl exports:

Source:
screaming-frog-exports/{client}/{domain}/{crawl_run}/

Output:
txt-to-json/{client}/{date}/{run}/

Competitor:
txt-to-json/{client}/{date}/{run}/competitors/{domain}/