How do I scrape a list of domains from a site?


SUBMITTED BY: Guest

DATE: June 15, 2019, 5:29 p.m.

FORMAT: Text only

SIZE: 2.8 kB

HITS: 409

  1. How do I scrape a list of domains from a site?
  2. I am trying to compile a list of all of the domains that are on a site.
  3. ++++++++++++++++++++++++++++
  4. list of top cheapest hosting http://listtop.pw/
  5. Top 200 best traffic exchange sites http://listtop.pw/surf
  6. list of top gpt sites
  7. list of top ptc sites
  8. list of top ptp sites
  9. list of top crypto currency Wallets sites
  10. Listtop.pw
  11. Listtop.pw
  12. Listtop.pw
  13. +++++++++++++++++++++++++++++
  14. For example, I want to find all of the domains that are on Youtube.
  15. Would the search operation be site
  16. Youtube.com ".com" + ".net" + ".org" etc?
  17. Can that operator be tweaked so it includes all domains such as .io etc without having to mention it?
  18. Depending on what software you are using to scrape, you could use a regex for that.
  19. For example: /(?:[a-z0-9](?:[a-z0-9-]{0,61}[a-z0-9])?\.)+[a-z0-9][a-z0-9-]{0,61}[a-z0-9]/g
  20. That would match the domain names. Example of that regex:
  21. Code:
  22. https://regexr.com/3au3g
  23. It will work on .io domains too.
  24. Edit: If you are talking about finding in google search, I do not think it will work. Not even the solution you are trying will work correctly.
  25. Depending on what software you are using to scrape, you could use a regex for that.
  26. For example: /(?:[a-z0-9](?:[a-z0-9-]{0,61}[a-z0-9])?\.)+[a-z0-9][a-z0-9-]{0,61}[a-z0-9]/g
  27. That would match the domain names. Example of that regex:
  28. Code:
  29. https://regexr.com/3au3g
  30. It will work on .io domains too.
  31. Edit: If you are talking about finding in google search, I do not think it will work. Not even the solution you are trying will work correctly.
  32. Click to expand...
  33. What would be the best way to do so? Let's say the site is a directory. May or may not be that well indexed in Google.
  34. Home Page:https://rebrand.ly/Only-KEYWORD-Research-Needed
  35. STEP 1 :
  36. Plug the website on Ahrefs ( You can do trial = 7 USD for 7 days)
  37. STEP 2 : Click Linked Domain
  38. STEP 3: Result in less than 4 seconds
  39. What would be the best way to do so? Let's say the site is a directory. May or may not be that well indexed in Google.
  40. Topiano's idea is awesome to be frank. This will be the cheapest ( and the least time consuming) way to find the domains.
  41. Otherwise, I would simply make a python domain crawler.
  42. Python+ selenium can work perfectly.
  43. If the links are clickable you can use href property to get all the domains. If non clickable then get text from the body section and use regex to get all the links and use another regex to get domains only
  44. centexhosting.com
  45. hosting xs
  46. make money uploading videos
  47. w hostel boracay contact number
  48. hosting u bosni
  49. o domaine des delices marrakech
  50. netsukses.com
  51. sunbtc.space
  52. make money zaful
  53. undostreschocolate.wixsite.com
  54. olymptrade.com
  55. 2 host tick
  56. make money kindle publishing
  57. idomains reviews

comments powered by Disqus