403 as response #283
-
| Just double checking if I understand project assumptions correctly. Is it only the matter of site response or Scrapegraph-ai additionally restrict access base on eg robot.txt? |
Beta Was this translation helpful? Give feedback.
Answered by PeriniM May 22, 2024
Replies: 2 comments 5 replies
-
| I think we can overcome the problem with the robots txt, pls write the code |
Beta Was this translation helpful? Give feedback.
5 replies
-
| we suggest you to use this proxy for making the proxy rotation https://dashboard.statproxies.com/?refferal=scrapegraph |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Hey @fx71 try setting the headless flag to False and you will be able to fetch the HTML. Sometimes it happens for javascript-heavy website