Skip to content

Commit

Permalink
continue if no text available; Achi bible has text in browser but not…
Browse files Browse the repository at this point in the history
… from crawler (or curl)
  • Loading branch information
jimregan authored and brawer committed Dec 15, 2017
1 parent 09f24f1 commit d12e53c
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions Lib/corpuscrawler/util.py
Original file line number Diff line number Diff line change
Expand Up @@ -651,6 +651,8 @@ def crawl_bibleis(crawler, out, bible):
if doc.status != 200:
continue
html = doc.content.decode('utf-8')
if '<p>No text available for the selected Bible.</p>' in html:
continue
if 'var audioUrl = ' in html:
audio = html.split('var audioUrl = "')[1].split('"')[0]
inner = html.split('<div id="chapter-content"')[1].split('<div class="content-text">')[1].split('<hr>')[0]
Expand Down

0 comments on commit d12e53c

Please sign in to comment.