Australian Internet Data Downloads

This page has info on how to download licensed data ($) from the Wallabyup databases. Download files are zip compressed (and password protected) and are created every Saturday morning. Scroll down to how to buy data. 4 downloadable files (tabbed CSV format); - all URL's (no body content), - domains (each domain/subdomain has a row in the database), - outlinks (sites linking to other 3rd party sites), - invalid pages.

All URLs Download $810

A download of all URLs in the Wallabyup main database. File sample (not live): sample-all_urls.txt (when importing use a tab (\t) separator, not a comma). Rows: 70,591,008.
File size (zipped): 6 GB.
File size (raw): 17 GB.
Download file: myindex-2024-07-27.zip (unlock password with payment).
The file is a tabbed CSV file with the below 11 columns; - id, scheme, url, crawled, nextCrawl, runTime, domain, au, fromy, done, oDone Toogle more info

Domains Download $675

A download of all domains and sub domains. Rows: 873,724.
File size (zipped): 55 MB.
File size (raw): 260 MB.
Download file: Domains-kulled_3_clms-2024-07-27.zip (unlock password with payment).
The file is a tabbed CSV file with the below columns; - id, host, domain, ipCrawled, runTime, countryCode, country, isp, ipRangeQueryDate, fromy, found, spam, robotsCrawled, robotsTxt, statDone, siteHomeBl, tldBl, sitePages, tldPages, penaltyNext, penaltyNote. Toogle more info

Outlinks Download $960

A download of all backlinks (sites outlinking to other sites). Rows: 22,431,046.
File size (zipped): 2,433 MB.
File size (raw): 6,527 MB.
Download file: outlinks-2024-07-27.zip (unlock password with payment).
The file is a tabbed CSV file with the below columns; - id, host, url, added, bulkPoints, domain, outlinkScheme, outlinkUrl, homePage, outlinkDomain, outlinkTld, anchorText, spam, occupationId, linkJuice, follow. Toogle more info

Invalid Download $260

A download of invalid pages crawled. Rows: 20,929,247.
File size (zipped): 1,363 MB.
File size (raw): 6,488 MB.
Download file: Nopes-2024-07-27.zip (unlock password with payment).
The file is a tabbed CSV file with the below columns; - id, host, domain, ipCrawled, runTime, countryCode, country, isp, ipRangeQueryDate, fromy, found, spam, robotsCrawled, robotsTxt, statDone, siteHomeBl, tldBl, sitePages, tldPages, penaltyNext, penaltyNote. Toogle more info

How To Buy Data

1) Send payment via PayID (through your banking app/portal) to my PayID* (see my email below) and include in the description a unique identifier like your first name or email address. PayID is more secure as it is a push payment (unlike credit cards). * pay at daniellyons.net 2) Use the contact form at DanielLyons.net and say what download file you want. 3) I will reply to your email with a password to unlock the zip file. You can check both Wallabyup.au and DanielLyons.net are know each other with a Wal site profile lookup showing both sites link to each other (reciprocal links). Refunds have a 1 month delay from after you supply your details (this prevents money muling).

Licence

The data is copyrighted under the following licence conditions; - You can not offer more than 2% of each data download per week to individual 3rd parties... in other words you can't just bulk sell the data yourself for 50% less than what Wallabyup charges (or for free). - You can copy (see limit in previous point) and redistribute the material in any medium or format for any purpose, even commercially. - You must attribute Wallabyup as the copyright holder and the supplier of the data (including by providing a prominent URL link to Wallabyup.au). - You can not transfer a data download or licence to others (no sublicensing). - Wallabyup can revoke the licence for any reason. An example of licence termination is when the data is used for subversive reasons (hacking or other crimes). - The licence lasts forever for the downloaded data. - Derivative works do not need to be distributed under the same licence as long as the above conditions are 1st met (including not offering more than 2% / 1st point).