Newest 'python+javascript+scrapy' Questions

0 votes

0 answers

19 views

Is there a way to mimic the Element.closest() function from javascript in Scrapy python?

I am trying to convert my web-scraper I built in JavaScript using the puppeteer library into a python-based web-scraper running on Scrapy. I want to be able to do something similar to JavaScript's ...

Christopher Cho

1

asked May 8 at 20:24

1 vote

1 answer

72 views

Can't Scrape a webpage whose contents are dynamically generated through JavaScript

I am trying to scrape table data from a webpage but it's not a normal webpage that can be scraped using its html tags and CSS class or ID. The contents of the webpage are dynamically generated using ...

Abhinay

11

asked Mar 14 at 18:05

0 votes

0 answers

50 views

Why is Scrapy-splash not returning expected HTML from dynamic javascript page?

I'm attempting to scrape the Market table data from the following page utilizing scrapy-splash: "manta.layerbank.finance/bank" (Put in quotes because might be causing spam issue?) So far I'm ...

Kody F

1

asked Jan 24 at 0:16

0 votes

2 answers

52 views

CSS Notation for a Scrapy Spider Script

I wrote the below python script to return the item name, price, and link for items listed on https://shop.doverstreetmarket.com/collections/shops-noah import scrapy class DSMUKSpider(scrapy.Spider): ...

Teron

23

asked Dec 19, 2023 at 23:03

0 votes

1 answer

364 views

How to scrape location data from a leaflet map?

I want to access the location (latitude, longitude) of the water level sensor markers found in this website but I can't find any HTML tags which contains their locations. Any guidance would be very ...

Msh. Niyaz

33

asked Aug 15, 2023 at 10:52

0 votes

0 answers

62 views

using scrapy with selenium together

i was trying to integrate selenium into my scrapy project i had a middleware setup for selenium chrome as such now it works fine it loads the page and it collect data needed . tho couldn't figure a ...

Low LiFe

15

asked Aug 5, 2023 at 14:14

-1 votes

1 answer

84 views

How can I loop in unlimited scroll sites to extract every page?

I don't want to use api to extract data i just want to learn this way for the project. The element for next page is not visible and the website has unlimited scroll. I have scraped the first page but ...

Anish Thapa

1

asked Jul 3, 2023 at 17:01

0 votes

0 answers

84 views

Playwright doesnot return anything?

import scrapy from scrapy_playwright.page import PageMethod class PositionsSpider(scrapy.Spider): name = "positions" allowed_domains = ["https://trafigura.com/"] ...

Anish Thapa

1

asked Jun 28, 2023 at 16:33

0 votes

0 answers

174 views

Pycharm JavaScript heap out of memory on Unbuntu

Extracting data from multiple urls using scrapy-playwright leads to following error after parsing about 1000 urls. FATAL ERROR: Reached heap limit Allocation failed - JavaScript heap out of memory I ...

Michael

367

asked Jan 26, 2023 at 20:00

1 vote

0 answers

919 views

Button not clicking with scrapy playwright

I am attempting to click on an sso login for a platform by testing its button functionality with scrapy playwright. I have inputted an incorrect email and so after clicking the button, it should throw ...

Dollar Tune-bill

367

asked Dec 10, 2022 at 11:14

1 vote

0 answers

43 views

Is there anyway of forcing a javascript GET request before rendering html with Splash?

I'm getting stucked loading a dynamic content from this website with Splash: https://www.fravega.com/l/tv-y-video/tv/?categorias=tv-y-video%2Ftv&page=1 I'm trying to get the href attribute from ...

Drupman

11

asked Sep 1, 2022 at 23:02

0 votes

0 answers

92 views

Data Scraping: Integrate Scrapy [python library to build spiders] with pupeteer- or playwright-extra [JS headless browser automation]

I am currently developing multiple scrapers that should be maintained for the next couple of years. My typcial approach to traverse large pages is to use scrapy, a well maintained python framework to ...

Rondo Bohrens

27

asked Aug 16, 2022 at 9:29

0 votes

1 answer

73 views

Is it possible to extract the download syllabus link with requests or scrapy without selenium

I am trying to extract the download syllabus link from this website- https://www.simplilearn.com/big-data-and-analytics/python-for-data-science-training The link is not available on page source, and I ...

nikhil kumar

61

asked Jul 17, 2022 at 4:43

-2 votes

1 answer

36 views

I want to fetch the details from payload tab of developers tool

I want to fetch the details of url which differ from ones which are present in the anchor tab. Accessing the href link directs to previous page rather than next page. How do I fetch the following view ...

Reema

11

asked Jul 13, 2022 at 16:15

1 vote

0 answers

55 views

Support with scrapy. get data from a double Postback

I am looking for help with a specific problem to get data from a postback table. I need to access a table that is loaded after pressing a button with a JavaScript PostbackWithOption. I think I am ...

crianopa

77

asked Jun 27, 2022 at 7:17

Collectives™ on Stack Overflow

All Questions

Is there a way to mimic the Element.closest() function from javascript in Scrapy python?

Can't Scrape a webpage whose contents are dynamically generated through JavaScript

Why is Scrapy-splash not returning expected HTML from dynamic javascript page?

CSS Notation for a Scrapy Spider Script

How to scrape location data from a leaflet map?

using scrapy with selenium together

How can I loop in unlimited scroll sites to extract every page?

Playwright doesnot return anything?

Pycharm JavaScript heap out of memory on Unbuntu

Button not clicking with scrapy playwright

Is there anyway of forcing a javascript GET request before rendering html with Splash?

Data Scraping: Integrate Scrapy [python library to build spiders] with pupeteer- or playwright-extra [JS headless browser automation]

Is it possible to extract the download syllabus link with requests or scrapy without selenium

I want to fetch the details from payload tab of developers tool

Support with scrapy. get data from a double Postback

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags