All Questions
Tagged with python-2.7 beautifulsoup
1,252
questions
2
votes
1
answer
285
views
Web Scraping with Beautifulsoup python - cannot scrap all results
i tried to scrap data from this site
https://www.nobroker.in/property/sale/chennai/Adyar?searchParam=...
-1
votes
1
answer
83
views
BeautifulSoup4 doesn't read ≈ as an HTML entity
This seems familiar; why does ≈ not get picked up by html.parser?
>>> from bs4 import BeautifulSoup
>>> for html in ['hey ‘ 3','hey π','hey ≈ 3'...
1
vote
1
answer
638
views
Python BeautifulSoup getting KeyError for image src
I am trying to get product image url from this page . I am getting image src but aslo getting this error KeyError: 'data-lazy-src'
here is my code:
image = soup.select('img.attachment-shop_single')
...
-1
votes
1
answer
65
views
NameError BeautifulSoup HTML parse
I wrote the code that intended to search and remove div tags with specific class from multiple .html files in the local directory(recursively). Python 2.7
#!/usr/bin/python
from bs4 import ...
0
votes
1
answer
97
views
POST Request Python Web Scraping: Get URLs from Tag
Hi i am new to both python and web scraping. From my scripts, I want to POST Request and then get the urls from it. But I got a problem, I cant get the urls from the a tag. When I inspect the website ...
0
votes
2
answers
244
views
How to get all products from a beautifulsoup page
I want to get all the products on this page:
nike.com.br/snkrs#estoque
My python code is this:
produtos = []
def aviso():
print("Started!") ...
0
votes
3
answers
217
views
How to get text inside source code with beautifulsoup
I'm trying to do webscraping on this page:
https://www.nike.com.br/air-max-pre-day-153-169-211-330676
If you look at the source code, and look for the term "Tamanho" (with quotes) you ...
1
vote
0
answers
68
views
How to scrape a webpage that uses JavaScript to build the HTML? [duplicate]
I have a scraper made in python2 using requests and beautifulsoup. We do not want to use selenium. Now the website we are scraping has changed its some page. The page is now building the elements on ...
0
votes
0
answers
80
views
Loop while dynamically scraping - Python
I want to try to make realtime scrapes which have separate intervals. For example, the last data I scrape is T = 1 then it will loop once every 6 hours, T = 2 then it will loop 1 hour once and T = 3 ...
0
votes
1
answer
27
views
Need a better way to find the tag to use in beautifulsoup
This is my code which save the heading of www.nytimes.com in a .txt file.
Using requests and beautifulsoup
import requests
from bs4 import BeautifulSoup
url = requests.get("https://www.nytimes....
0
votes
2
answers
418
views
Beautiful Soup and requests problem it doesn't show any text output
I am using beautiful soup and requests to print full text of the article of this wedsite
https://www.vanityfair.com/style/society/2014/06/monica-lewinsky-humiliation-culture
This is my code:
import ...
1
vote
0
answers
1k
views
BeautifulSoup - "Couldn't find a tree builder" error, what am I missing?
Upon running my python script I am getting following error
File "myfile.py", line 538, in parse_format_views_into_memsource_json
page_soup = BeautifulSoup(page, 'xml')
File "C:\...
0
votes
1
answer
88
views
BS4 (BeautifulSoap) - AttributeError: 'NoneType' object has no attribute 'getText'
I've been researching here and the solutions I found for my problem were unable to solve.
I'm trying to create a scraping of the Amazon page that captures the product's name and price, but as I try to ...
2
votes
1
answer
569
views
How to select all elements based on alternative attributes? [BeautifulSoup]
I have to select all html-tags that has class attribute if the class name is in list OR has a style attribute with the specified value.
I tried to solve my problem step by step and I got this:
...
2
votes
1
answer
125
views
Beautiful Soup Scraping
I'm having issues with old working code not functioning correctly anymore.
My python code is scraping a website using beautiful soup and extracting event data (date, event, link).
My code is pulling ...