From fb17a68b5ee009fa2a36a6a447af872f409d4954 Mon Sep 17 00:00:00 2001 From: Dan Schaper Date: Sun, 30 Oct 2016 06:44:10 -0700 Subject: [PATCH] Add links and format --- ....md => Customising-Sources-for-Ad-Lists.md | 27 ++++++++++--------- 1 file changed, 15 insertions(+), 12 deletions(-) rename Customising-sources-for-ad-lists.md => Customising-Sources-for-Ad-Lists.md (77%) diff --git a/Customising-sources-for-ad-lists.md b/Customising-Sources-for-Ad-Lists.md similarity index 77% rename from Customising-sources-for-ad-lists.md rename to Customising-Sources-for-Ad-Lists.md index 434488b..04a7733 100644 --- a/Customising-sources-for-ad-lists.md +++ b/Customising-Sources-for-Ad-Lists.md @@ -1,4 +1,4 @@ -# Pi-hole's Default Block Lists +## Pi-hole's Default Block Lists By default, when `pihole -g` pulls in lists of domains to block, we combine several lists, which are defined in [`/etc/pihole/adlists.default`](https://github.com/pi-hole/pi-hole/blob/master/adlists.default): Note: There are several lists that are commented out. In order to enable them, follow the instructions at the top of the file. After making any changes, run `pihole -g` to pull in any changes. @@ -6,10 +6,10 @@ Note: There are several lists that are commented out. In order to enable them, f If you add any domains, then they can go anywhere in the file, so long as they are not commented out (prefixed with `#`) ```text -## Pi-hole ad-list default sources. Updated 22/05/2016 ######################### +## Pi-hole ad-list default sources. Updated 29/10/2016 ######################### # # # To make changes to this file: # -# 1. run `cp /etc/pihole/adlists.default /etc/pihole/adlists.list` # +# 1. run `cp /etc/pihole/adlists.default /etc/pihole/adlists.list` # # 2. run `nano /etc/pihole/adlists.list` # # 3. Uncomment or comment any of the below lists # # # @@ -22,7 +22,6 @@ If you add any domains, then they can go anywhere in the file, so long as they a https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts # Other lists we consider safe: -http://adblock.gjtech.net/?format=unix-hosts http://mirror1.malwaredomains.com/files/justdomains http://sysctl.org/cameleon/hosts https://zeustracker.abuse.ch/blocklist.php?download=domainblocklist @@ -30,7 +29,7 @@ https://s3.amazonaws.com/lists.disconnect.me/simple_tracking.txt https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt # hosts-file.net list. Updated frequently, but has been known to block legitimate sites. -http://hosts-file.net/ad_servers.txt +https://hosts-file.net/ad_servers.txt # Mahakala list. Has been known to block legitimage domains including the entire .com range. # Warning: Due to the sheer size of this list, the web admin console will be unresponsive. @@ -40,7 +39,7 @@ http://hosts-file.net/ad_servers.txt #http://optimate.dl.sourceforge.net/project/adzhosts/HOSTS.txt # Windows 10 telemetry list -#https://raw.githubusercontent.com/crazy-max/WindowsSpyBlocker/master/hosts/windows10_spy.txt +#https://raw.githubusercontent.com/crazy-max/WindowsSpyBlocker/master/data/hosts/win10/spy.txt # Securemecca.com list - Also blocks "adult" sites (pornography/gambling etc) #http://securemecca.com/Downloads/hosts.txt @@ -51,20 +50,24 @@ https://raw.githubusercontent.com/quidsup/notrack/master/trackers.txt # Block the BBC News website Breaking News banner #https://raw.githubusercontent.com/BreakingTheNews/BreakingTheNews.github.io/master/hosts - # Untested Lists: #https://raw.githubusercontent.com/reek/anti-adblock-killer/master/anti-adblock-killer-filters.txt -#http://spam404bl.com/spam404scamlist.txt +#https://raw.githubusercontent.com/Dawsey21/Lists/master/main-blacklist.txt #http://malwaredomains.lehigh.edu/files/domains.txt +# Following two lists should be used simultaneously: (readme https://github.com/notracking/hosts-blocklists/) +#https://raw.github.com/notracking/hosts-blocklists/master/hostnames.txt +#https://raw.github.com/notracking/hosts-blocklists/master/domains.txt +# Combination of serveral host files on the internet (warning some facebook domains are also blocked but you can go to facebook.com). See https://github.com/mat1th/Dns-add-block for more information. +#https://raw.githubusercontent.com/mat1th/Dns-add-block/master/hosts ``` -### Block More Than Advertisements +## Block More Than Advertisements By using alternate lists, you have the ability to block tracking sites, malware domains, known spam servers, and more. We've included many of these lists in `adlists.default`, but they are commented out. In order to use them, copy `adlists.default` to `adlists.list` and uncomment them. -## These Lists Will Need Additional Parsing Logic +### These Lists Will Need Additional Parsing Logic The lists below are not in standard hosts format. Since Pi-hole blocks ads at the DNS level, **just** the domain name needs to be extracted from the lists. To do this, you will likely need to use `sed` and `awk` to parse down to get just the domain names. * `http://jansal.googlecode.com/svn/trunk/adblock/hosts` * `http://www.sa-blacklist.stearns.org/sa-blacklist/sa-blacklist.current` @@ -87,7 +90,7 @@ Image you found a list you want to use, but it is formatted with a bunch of extr ||linkz.it^ ``` -You can use `sed` and/or `awk` (or other commands) to remove the extra characters to get just the domain name. It helps to be familiar with scripting, but if you wanted to parse down the list above, you could do something like this: +You can use [`sed`](https://linux.die.net/man/1/sed) and/or [`awk`](https://linux.die.net/man/1/awk) (or other commands) to remove the extra characters to get just the domain name. It helps to be familiar with scripting, but if you wanted to parse down the list above, you could do something like this: ``` curl -s http://some.list | sed 's/^||//' ``` @@ -99,7 +102,7 @@ cracksplay.com^ fbgamecheatz.info^ linkz.it^ ``` -Then, you could use `sed` again, or even something like `cut`. Since the domains won't have a carat in the name, you can use it as a delimiter with the `cut` command to display only the domain name. +Then, you could use `sed` again, or even something like [`cut`](https://linux.die.net/man/1/cut). Since the domains won't have a carat in the name, you can use it as a delimiter with the `cut` command to display only the domain name. ``` curl -s http://some.list | sed 's/^||//' | cut -d'^' -f-1 ```