Pi-hole shouldn't block super common things like 'google.com'
Actual Behaviour:
pi-hole blocked google.com because gravity and I had to white list it. I find that since the v5 update, I have added more whitelisted domains than I did in the past year with previous versions leading up to v5.
Seems like you have a lot of adlists that are not well maintained and include a lot of false positives (like Google). Solution is to get rid of shitty adlists.
That doesn't really make sense; 'shitty' adlists that aren't maintained would have been throwing positives/blocks of common things before the upgrade to v5 wouldn't they?
But sure, I'm game to refine my lists and get more up to date stuff. Any particularly bad offenders you see in the lists I'm using? Can you recommend any that would be considered 'good/current/maintained'?
If you include only lists, that are "Lists in green and bulleted with a tick are least likely to interfere with browsing", you should be relative save.
No. Prior to V5, the entire list was rejected. Starting in V5 we parse as much as possible and show how many domains were rejected in the output of pihole -g. You are likely using adblock or easy list formatted lists - these cause exactly this problem. Eliminate those lists and rebuild gravity.
Pi-hole doesn't block or allow anything on it's own. It blocks what you ask it to block via adlists. If you use adlists that include common domains like google.com on it, well, it will do what you ask and block google.com.
As the others have noted, don't use lists that include google.com as a domain to block.
I had used the easylist/adblock quite a while ago, but pulled it out because that one has been unsupported for pi-hole for quite a long time.
Here's my current gravity update output. There are definitely a couple that can come out for failing to find a list, but overall seems like a pretty normal refresh of gravity. The 'DNS service is not running' message irks me but I have a different help post open for that. A number of these are legitimately older lists and I am not sure when they were last maintained, see earlier post about updating some of them. Time to purge everything and start from scratch, I guess. Woo (/sarcasm)
[i] Neutrino emissions detected...
[✓] Pulling blocklist source list into range
[✓] Preparing new gravity database
[i] Target: https://raw.githubusercontent.com/StevenBlack/hosts/master/hosts
[✓] Status: Retrieval successful
[i] Received 57718 domains
[i] Target: https://mirror1.malwaredomains.com/files/justdomains
[✓] Status: No changes detected
[i] Received 26853 domains
[i] Target: http://sysctl.org/cameleon/hosts
[✓] Status: No changes detected
[i] Received 20567 domains
[i] Target: https://zeustracker.abuse.ch/blocklist.php?download=domainblocklist
[✓] Status: Retrieval successful
[i] Received 0 domains
[i] Target: https://s3.amazonaws.com/lists.disconnect.me/simple_tracking.txt
[✓] Status: No changes detected
[i] Received 34 domains
[i] Target: https://s3.amazonaws.com/lists.disconnect.me/simple_ad.txt
[✓] Status: No changes detected
[i] Received 2701 domains
[i] Target: https://hosts-file.net/ad_servers.txt
[✗] Status: Not found
[✗] List download failed: no cached list available
[i] Target: https://dbl.oisd.nl/
[✓] Status: Retrieval successful
[i] Received 1250033 domains
[i] Target: https://phishing.army/download/phishing_army_blocklist_extended.txt
[✓] Status: Retrieval successful
[i] Received 16263 domains
[i] Target: https://raw.githubusercontent.com/CHEF-KOCH/Audio-fingerprint-pages/master/AudioFp.txt
[✓] Status: Retrieval successful
[i] Received 371 domains
[i] Target: https://raw.githubusercontent.com/CHEF-KOCH/Canvas-fingerprinting-pages/master/Canvas.txt
[✓] Status: Retrieval successful
[i] Received 14335 domains
[i] Target: https://raw.githubusercontent.com/CHEF-KOCH/WebRTC-tracking/master/WebRTC.txt
[✓] Status: Retrieval successful
[i] Received 807 domains
[i] Target: https://raw.githubusercontent.com/deathbybandaid/piholeparser/master/Subscribable-Lists/ParsedBlacklists/AakList.txt
[✓] Status: Retrieval successful
[i] Received 5 domains
[i] Target: https://raw.githubusercontent.com/deathbybandaid/piholeparser/master/Subscribable-Lists/ParsedBlacklists/Prebake-Obtrusive.txt
[✓] Status: Retrieval successful
[i] Received 3 domains
[i] Target: https://jasonhill.co.uk/pfsense/ytadblock.txt
[✗] Status: https://jasonhill.co.uk/pfsense/ytadblock.txt (503)
[✗] List download failed: no cached list available
[✓] Storing downloaded domains in new gravity database
[✓] Building tree
[✓] Swapping databases
[i] Number of gravity domains: 1389690 (1301580 unique domains)
[i] Number of exact blacklisted domains: 1
[i] Number of regex blacklist filters: 0
[i] Number of exact whitelisted domains: 25
[i] Number of regex whitelist filters: 0
[✓] Cleaning up stray matter
[✗] DNS service is NOT running
I had whitelisted googleadservices because it broke a few things in android not to have it available, but that was well over a year ago, disabled it in the whitelist after pulling this output, fingers crossed it doesn't break things again.
When it came back with no matches, I was confused because I know I had explicitly whitelisted google.com, and it wasn't showing in my UI. Slipped my mind that I actually have two pi-hole installations; one is for my house (which I've been generating output from so far), the other is on a vpn server I created for our wireless devices when we leave the house (using a hosted virtual server because our ISP doesn't allow incoming connections). It appears that I removed the adblock/easylist entries from the house one but not from the vpn pi-hole, so I can clean that up easy enough. The general tendency to need to add a lot more domains to the whitelist than I ever had with non v5 versions of pi-hole is still of concern though. I think I'll wipe both of these pi-hole installs clean on lists and make sure they both match.
To that end, is there a way to export the collection of lists (whitelist, blacklist, gravity lists) from my home instance and import them into the VPN server instance or will I need to do it all by hand?
root@localhost:~# pihole -q -exact google.com
Exact match found in exact whitelist
google.com
Exact matches for google.com found in:
- Https://easylist-downloads.adblockplus.org/easylist.txt
- Https://easylist-downloads.adblockplus.org/easylist.txt
- Https://easylist-downloads.adblockplus.org/easylist.txt
- Https://easylist-downloads.adblockplus.org/easylist.txt
- Https://easylist-downloads.adblockplus.org/easylist.txt
- Https://easylist-downloads.adblockplus.org/easylist.txt
- Https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt
- https://easylist-downloads.adblockplus.org/easylist.txt