~seirdy/seirdy.one

a81e185b7d78be34b02f7e99af94894811993ffa — Rohan Kumar 27 days ago b5e8d35
CI/Makefile: reduce zopfli compression even more

70 iterations seems to be the point at which we hit diminishing returns.
The combined size difference of all *.gz files with 70 iterations and
1000 iterations is 65 bytes.
2 files changed, 4 insertions(+), 11 deletions(-)

M Makefile
M content/posts/search-engines-with-own-indexes.md
M Makefile => Makefile +3 -9
@@ 13,9 13,8 @@ GEMINI_RSYNC_DEST = $(USER):$(GEMINI_ROOT)

OUTPUT_DIR = public
RSYNCFLAGS += -rlvz --zc=zstd
# max compression
ZOPFLI_ITERATIONS=500
ZOPFLI_ITERATIONS_LARGE=75
# compression has dimishing returns after this point
ZOPFLI_ITERATIONS=70

.PHONY: hugo
hugo: clean


@@ 65,13 64,8 @@ test: lint-css hint-devserver check-links

.PHONY: build
build: hugo
# gzip_static + max zopfli compression + brotli_static
ifndef NO_STATIC
	# compress RSS feeds separately, with fewer iterations since they're bigger
	find $(OUTPUT_DIR) -type f -name '*.xml' \
		| grep -v gemini \
		| xargs zopfli --i$(ZOPFLI_ITERATIONS_LARGE) --gzip
	find $(OUTPUT_DIR) -type f -name '*.html' -o -name '*.css' -o -name '*.txt' -o -name '*.asc' -o -name '*.webmanifest' -o -name "*.svg" \
	find $(OUTPUT_DIR) -type f -name '*.html' -o -name '*.css' -o -name '*.xml' -o -name '*.txt' -o -name '*.asc' -o -name '*.webmanifest' -o -name "*.svg" \
		| grep -v gemini \
		| xargs zopfli --i$(ZOPFLI_ITERATIONS) --gzip
	find $(OUTPUT_DIR) -type f -name '*.html' -o -name '*.css' -o -name '*.xml' -o -name '*.txt' -o -name '*.asc' -o -name '*.webmanifest' -o -name "*.svg" \

M content/posts/search-engines-with-own-indexes.md => content/posts/search-engines-with-own-indexes.md +1 -2
@@ 92,8 92,7 @@ These engines fail badly at a few important tests.
- [wbsrch](https://wbsrch.com/): In addition to its generalist search, it also has many other utilities related to domain name statistics. Failed multiple tests. Its index is a bit dated; it has an old backlog of sites it hasn't finished indexing. It also has several per-language indexes.
- [ExactSeek](https://www.exactseek.com/): small index, disproportionately dominated by big sites. Failed multiple tests. Allows submitting individual URLs for crawling, but requires entering an email address and receiving a newsletter. Webmaster tools seem to heavily push for paid <abbr title="search-engine optimization">SEO</abbr> options.
- [Meorca](https://meorca.com/): a search engine that claims not to "index pornography or illegal content websites". It also features a public blog with a marketplace and free games. Allows submitting URLs, but requires a full name, email, phone number, and "business name" to do so. Discovered in the seirdy.one access logs.

* [search.tl](http://www.search.tl/): Generalist search for one <abbr title="top-level domain">TLD</abbr> at a time (defaults to .com). I'm not sure why you'd want to always limit your searches to a single TLD, but now you can.[^7] There isn't any visible UI for changing the TLD for available results; you need to add/change the `tld` URL parameter. For example, to search .org sites, append `&tld=org` to the URL. It seems to be connected to [Amidalla](http://www.amidalla.de/), but Amidalla doesn't seem to currently be operational. Amidalla allows users to manually add URLs to its index and directory; I have yet to see if doing so impacts search.tl results.
- [search.tl](http://www.search.tl/): Generalist search for one <abbr title="top-level domain">TLD</abbr> at a time (defaults to .com). I'm not sure why you'd want to always limit your searches to a single TLD, but now you can.[^7] There isn't any visible UI for changing the TLD for available results; you need to add/change the `tld` URL parameter. For example, to search .org sites, append `&tld=org` to the URL. It seems to be connected to [Amidalla](http://www.amidalla.de/), but Amidalla doesn't seem to currently be operational. Amidalla allows users to manually add URLs to its index and directory; I have yet to see if doing so impacts search.tl results.

### Unusable engines, irrelevant results