Add links and format

Dan Schaper 2016-10-30 06:44:10 -07:00
parent 7f3a956c4a
commit fb17a68b5e

@ -1,4 +1,4 @@
# Pi-hole's Default Block Lists
## Pi-hole's Default Block Lists
By default, when `pihole -g` pulls in lists of domains to block, we combine several lists, which are defined in [`/etc/pihole/adlists.default`](https://github.com/pi-hole/pi-hole/blob/master/adlists.default):
Note: There are several lists that are commented out. In order to enable them, follow the instructions at the top of the file. After making any changes, run `pihole -g` to pull in any changes.
@ -6,10 +6,10 @@ Note: There are several lists that are commented out. In order to enable them, f
If you add any domains, then they can go anywhere in the file, so long as they are not commented out (prefixed with `#`)
```text
## Pi-hole ad-list default sources. Updated 22/05/2016 #########################
## Pi-hole ad-list default sources. Updated 29/10/2016 #########################
# #
# To make changes to this file: #
# 1. run `cp /etc/pihole/adlists.default /etc/pihole/adlists.list` #
# 1. run `cp /etc/pihole/adlists.default /etc/pihole/adlists.list` #
# 2. run `nano /etc/pihole/adlists.list` #
# 3. Uncomment or comment any of the below lists #
# #
@ -22,7 +22,6 @@ If you add any domains, then they can go anywhere in the file, so long as they a
https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts
# Other lists we consider safe:
http://adblock.gjtech.net/?format=unix-hosts
http://mirror1.malwaredomains.com/files/justdomains
http://sysctl.org/cameleon/hosts
https://zeustracker.abuse.ch/blocklist.php?download=domainblocklist
@ -30,7 +29,7 @@ https://s3.amazonaws.com/lists.disconnect.me/simple_tracking.txt
https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt
# hosts-file.net list. Updated frequently, but has been known to block legitimate sites.
http://hosts-file.net/ad_servers.txt
https://hosts-file.net/ad_servers.txt
# Mahakala list. Has been known to block legitimage domains including the entire .com range.
# Warning: Due to the sheer size of this list, the web admin console will be unresponsive.
@ -40,7 +39,7 @@ http://hosts-file.net/ad_servers.txt
#http://optimate.dl.sourceforge.net/project/adzhosts/HOSTS.txt
# Windows 10 telemetry list
#https://raw.githubusercontent.com/crazy-max/WindowsSpyBlocker/master/hosts/windows10_spy.txt
#https://raw.githubusercontent.com/crazy-max/WindowsSpyBlocker/master/data/hosts/win10/spy.txt
# Securemecca.com list - Also blocks "adult" sites (pornography/gambling etc)
#http://securemecca.com/Downloads/hosts.txt
@ -51,20 +50,24 @@ https://raw.githubusercontent.com/quidsup/notrack/master/trackers.txt
# Block the BBC News website Breaking News banner
#https://raw.githubusercontent.com/BreakingTheNews/BreakingTheNews.github.io/master/hosts
# Untested Lists:
#https://raw.githubusercontent.com/reek/anti-adblock-killer/master/anti-adblock-killer-filters.txt
#http://spam404bl.com/spam404scamlist.txt
#https://raw.githubusercontent.com/Dawsey21/Lists/master/main-blacklist.txt
#http://malwaredomains.lehigh.edu/files/domains.txt
# Following two lists should be used simultaneously: (readme https://github.com/notracking/hosts-blocklists/)
#https://raw.github.com/notracking/hosts-blocklists/master/hostnames.txt
#https://raw.github.com/notracking/hosts-blocklists/master/domains.txt
# Combination of serveral host files on the internet (warning some facebook domains are also blocked but you can go to facebook.com). See https://github.com/mat1th/Dns-add-block for more information.
#https://raw.githubusercontent.com/mat1th/Dns-add-block/master/hosts
```
### Block More Than Advertisements
## Block More Than Advertisements
By using alternate lists, you have the ability to block tracking sites, malware domains, known spam servers, and more.
We've included many of these lists in `adlists.default`, but they are commented out. In order to use them, copy `adlists.default` to `adlists.list` and uncomment them.
## These Lists Will Need Additional Parsing Logic
### These Lists Will Need Additional Parsing Logic
The lists below are not in standard hosts format. Since Pi-hole blocks ads at the DNS level, **just** the domain name needs to be extracted from the lists. To do this, you will likely need to use `sed` and `awk` to parse down to get just the domain names.
* `http://jansal.googlecode.com/svn/trunk/adblock/hosts`
* `http://www.sa-blacklist.stearns.org/sa-blacklist/sa-blacklist.current`
@ -87,7 +90,7 @@ Image you found a list you want to use, but it is formatted with a bunch of extr
||linkz.it^
```
You can use `sed` and/or `awk` (or other commands) to remove the extra characters to get just the domain name. It helps to be familiar with scripting, but if you wanted to parse down the list above, you could do something like this:
You can use [`sed`](https://linux.die.net/man/1/sed) and/or [`awk`](https://linux.die.net/man/1/awk) (or other commands) to remove the extra characters to get just the domain name. It helps to be familiar with scripting, but if you wanted to parse down the list above, you could do something like this:
```
curl -s http://some.list | sed 's/^||//'
```
@ -99,7 +102,7 @@ cracksplay.com^
fbgamecheatz.info^
linkz.it^
```
Then, you could use `sed` again, or even something like `cut`. Since the domains won't have a carat in the name, you can use it as a delimiter with the `cut` command to display only the domain name.
Then, you could use `sed` again, or even something like [`cut`](https://linux.die.net/man/1/cut). Since the domains won't have a carat in the name, you can use it as a delimiter with the `cut` command to display only the domain name.
```
curl -s http://some.list | sed 's/^||//' | cut -d'^' -f-1
```