How do I scrape a list of domains from a site?
I am trying to compile a list of all of the domains that are on a site.
++++++++++++++++++++++++++++
list of top cheapest hosting http://listtop.pw/
Top 200 best traffic exchange sites http://listtop.pw/surf
list of top gpt sites
list of top ptc sites
list of top ptp sites
list of top crypto currency Wallets sites
Listtop.pw
Listtop.pw
Listtop.pw
+++++++++++++++++++++++++++++
For example, I want to find all of the domains that are on Youtube.
Would the search operation be site
Youtube.com ".com" + ".net" + ".org" etc?
Can that operator be tweaked so it includes all domains such as .io etc without having to mention it?
Depending on what software you are using to scrape, you could use a regex for that.
For example: /(?:[a-z0-9](?:[a-z0-9-]{0,61}[a-z0-9])?\.)+[a-z0-9][a-z0-9-]{0,61}[a-z0-9]/g
That would match the domain names. Example of that regex:
Code:
https://regexr.com/3au3g
It will work on .io domains too.
Edit: If you are talking about finding in google search, I do not think it will work. Not even the solution you are trying will work correctly.
Depending on what software you are using to scrape, you could use a regex for that.
For example: /(?:[a-z0-9](?:[a-z0-9-]{0,61}[a-z0-9])?\.)+[a-z0-9][a-z0-9-]{0,61}[a-z0-9]/g
That would match the domain names. Example of that regex:
Code:
https://regexr.com/3au3g
It will work on .io domains too.
Edit: If you are talking about finding in google search, I do not think it will work. Not even the solution you are trying will work correctly.
Click to expand...
What would be the best way to do so? Let's say the site is a directory. May or may not be that well indexed in Google.
Home Page:https://rebrand.ly/Only-KEYWORD-Research-Needed
STEP 1 :
Plug the website on Ahrefs ( You can do trial = 7 USD for 7 days)
STEP 2 : Click Linked Domain
STEP 3: Result in less than 4 seconds
What would be the best way to do so? Let's say the site is a directory. May or may not be that well indexed in Google.
Topiano's idea is awesome to be frank. This will be the cheapest ( and the least time consuming) way to find the domains.
Otherwise, I would simply make a python domain crawler.
Python+ selenium can work perfectly.
If the links are clickable you can use href property to get all the domains. If non clickable then get text from the body section and use regex to get all the links and use another regex to get domains only
centexhosting.com
hosting xs
make money uploading videos
w hostel boracay contact number
hosting u bosni
o domaine des delices marrakech
netsukses.com
sunbtc.space
make money zaful
undostreschocolate.wixsite.com
olymptrade.com
2 host tick
make money kindle publishing
idomains reviews