Ojisan Seiuchi

Fix your Anki streak - the script edition

December 30, 2024

Anki

Like many Anki users, I keep track of my streaks because it motivates me to do my reviews each day. But since life gets in the way sometimes, I may miss my reviews in one or more decks. It has been years since I’ve neglected to do all of my reviews; but sometimes I will forget to come back later in the day to finish up some of my decks. Since I like to have a clean review heatmap, I will “fix” my streak in a skipped deck.

Yes, this is “cheating”; but applied rarely, I gives me no moral qualms. YMMV.

I’ve described a manual process previously in which we execute queries directly against the Anki sqlite3 database. It works, but you have to deal with “bare metal” interaction with the database. There’s some risk involved. To make the process a little easier I’ve developed the following script. It just automates the review date correction so that you don’t have to interact directly with the database. I’ll walk you through the process; which does require a little technical facility, but only a little.

WARNING

Backing up your collection before running this script is strongly recommended.

Prerequisites and installation

N.B.: I work most of the time on macOS and have almost no experience on the Windows ecosystem. I’m sure this could be adapted to work on Windows; but that’s for someone else to do.

Ruby is installed by default on macOS; so you should be good there. If you want to be sure, you can check by going to the Terminal and typing which ruby. You should get something like:

➜  ~ which ruby
/Users/alan/.rbenv/shims/ruby
➜  ~

You will need to install a couple Ruby gems.

gem install sqlite3
gem install tzinfo

Copy the Ruby script (see below for the entire listing)
Install the script

cd ~/Documents  # or wherever you want to put the script
pbpaste > anki_streak_fix.rb

Your collection name is not going to be “Alan - Russian” so you can use any text editor (e.g. TextEdit) to change that in the code.

At this point should have everything you need installed on the system.

Usage

Open Anki and do a couple review in a deck where you missed your streak yesterday.
Now quit Anki.
Run the script from the Terminal:

cd ~/Documents # or wherever to saved the script ruby anki_streak_fix.rb “your_deck_name” –simulate

This should show you which cards will be moved to yesterday. If you’re satisfied with how that looks, then run the script without the --simulate flag.

Source code for the script

#!/usr/bin/env ruby

require 'sqlite3'
require 'optparse'
require 'time'
require 'tzinfo'

def get_system_timezone
  begin
    TZInfo::Timezone.get(Time.now.zone)
  rescue TZInfo::InvalidTimezoneIdentifier
    puts "Unknown system time, default to America/Toronto"
    TZInfo::Timezone.get('America/Toronto')
  end
end

class AnkiCollection
  def initialize(collection_name)
    base_path = "~/Library/Application Support/Anki2/"
    @path = base_path + collection_name + "/collection.anki2"
  end
  
  def collection_path
    File.expand_path(@path)
  end
end

class AnkiProcessor
  def initialize(deck_name, simulate: false)
    @deck_name = deck_name
    @simulate = simulate
    @db_path = AnkiCollection.new("Alan - Russian").collection_path
  end
  
  def process
    rid_string = generate_rid_string
    note_ids = fetch_reviewed_notes
    
    if note_ids.empty?
      puts "No notes found for today in deck '#{@deck_name}'"
      return
    end
    
    process_notes(note_ids, rid_string)
  end
  
  private
  
  def generate_rid_string
    system_timezone = get_system_timezone
    puts "Using timezone: #{system_timezone.identifier}"
    
    today = Time.now
    local_midnight = system_timezone.local_to_utc(Time.new(today.year, today.month, today.day))
    start_time = local_midnight.to_i * 1000
    end_time = (local_midnight + 86400).to_i * 1000
    
    "rid:#{start_time}:#{end_time}"
  end
  
  def fetch_reviewed_notes
    query = <<-SQL
      SELECT DISTINCT notes.id
      FROM cards
      JOIN notes ON cards.nid = notes.id
      JOIN decks ON cards.did = decks.id
      JOIN revlog ON cards.id = revlog.cid
      WHERE decks.name COLLATE NOCASE = ?
      AND date(revlog.id/1000, 'unixepoch', 'localtime') = date('now', 'localtime')
      ORDER BY notes.id;
    SQL
    
    begin
      db = SQLite3::Database.new(@db_path)
      db.results_as_hash = true
      db.execute(query, @deck_name)
    rescue SQLite3::Exception => e
      puts "Database error: #{e.message}"
      []
    ensure
      db&.close
    end
  end
  
  def process_notes(notes, rid_string)
    # Extract start and end times from rid_string
    start_time = rid_string.split(':')[1]
    end_time = rid_string.split(':')[2]
    
    begin
      db = SQLite3::Database.new(@db_path)
      
      notes.each do |row|
        note_id = row['id']
        
        if @simulate
          puts "Would execute: UPDATE revlog for note #{note_id} (#{start_time} to #{end_time})"
        else
          update_query = <<-SQL
            UPDATE revlog
            SET id = id - 86400000
            WHERE id IN (
              SELECT r.id
              FROM revlog r 
              INNER JOIN cards c ON r.cid = c.id
              INNER JOIN notes n ON n.id = c.nid
              WHERE n.id = ?
                AND r.id >= ?
                AND r.id < ?
            );
          SQL
          
          db.execute(update_query, [note_id, start_time, end_time])
          puts "Note date updated successfully for #{note_id}"
        end
      end
    rescue SQLite3::Exception => e
      puts "Database error: #{e.message}"
    ensure
      db&.close
    end
  end
end

# Parse command line arguments
options = {simulate: false}
parser = OptionParser.new do |opts|
  opts.banner = "Usage: #{$0} [options] DECK_NAME"
  opts.on('-s', '--simulate', 'Simulate ankifix calls (print only)') do |s|
    options[:simulate] = s
  end
end

parser.parse!

if ARGV.empty?
  puts parser.help
  exit 1
end

# Run the processor
processor = AnkiProcessor.new(ARGV[0], simulate: options[:simulate])
processor.process

If you have any difficulties or you have ideas for improvements, I can try to help. See my contact page.

An API (sort of) for adding links to ArchiveBox

December 25, 2024

Programming

I use ArchiveBox extensively to save web content that might change or disappear. While a REST API is apparently coming eventually, it doesn’t appear to have been merged into the main fork. So I cobbled together a little application to archive links via a POST request. It takes advantage of the archivebox command line interface. If you are impatient, you can skip to the full source code. Otherwise I’ll describe my setup to provide some context.

My ArchiveBox server

The ArchiveBox instance I’m running is on a Debian 12 box on my LAN, one that I use for a host of utility service that I run in my home lab. I installed it in a Docker container. To run the instance:

cd ~/archivebox/data
docker run -v $PWD:/data -p 8000:8000 -it archivebox/archivebox

Using the ArchiveBox command line interface

To archive a link, we can archivebox add your_url, but since we are running in a docker container, it would be docker exec --user=archivebox CONTAINER_NAME /bin/bash -c archivebox add your_url. This FastAPI application is essentially a wrapper around that CLI functionality.

FastAPI ArchiveBox server API

In the full source code below you’ll notice that I have hard-coded the CONTAINER_NAME. Yours may be different and can be found with:

docker ps | awk '
NR == 1 {
    # Print the header row
    printf "%-20s %-25s %-25s %s\n", $1, $2, $7, $NF
}
$0 ~ /^[a-z0-9]/ {
    # Print container details
    printf "%-20s %-25s %-25s %s\n", $1, $2, $7, $NF
}'

In my case I get:

so that’s what I use for CONTAINER_NAME

Adding a link for archival

To add a link via the /archive POST endpoint:

curl -X POST http://my_ip_address:9000/archive \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com", "tags": ["test"]}'

This returns, e.g.:

{
	"job_id":"1f532ca8-1466-414e-8d1e-b6fc9fe526b8",
	"status":"in_progress","url":"https://example.com",
	"start_time":"2024-12-25T05:45:24.541473",
	"end_time":null,"duration_seconds":null,
	"error":null,
	"output":null
}

If you want to check the progress of the archival job, you can query with the job_id returned with the submission:

curl http://my_ip_address:9000/status/

Full source code

from fastapi import FastAPI, HTTPException, BackgroundTasks
from pydantic import BaseModel, HttpUrl
from typing import List, Dict, Optional
import asyncio
import logging
import uvicorn
from datetime import datetime
import uuid
from collections import defaultdict

logging.basicConfig(
    level=logging.INFO,
    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'
)
logger = logging.getLogger('archivebox-service')

app = FastAPI(title="ArchiveBox Service")

CONTAINER_NAME = "unruffled_jones"

# In-memory storage for job status
jobs: Dict[str, Dict] = defaultdict(dict)

class ArchiveRequest(BaseModel):
    url: HttpUrl
    tags: List[str] = []

class JobStatus(BaseModel):
    job_id: str
    status: str
    url: str
    start_time: datetime
    end_time: Optional[datetime] = None
    duration_seconds: Optional[float] = None
    error: Optional[str] = None
    output: Optional[str] = None

async def run_in_container(cmd: List[str]) -> tuple[str, str]:
    docker_cmd = [
        "docker", "exec",
        "--user=archivebox",
        CONTAINER_NAME,
        "/bin/bash", "-c",
        " ".join(cmd)
    ]
    logger.info(f"Running command: {' '.join(docker_cmd)}")
    
    process = await asyncio.create_subprocess_exec(
        *docker_cmd,
        stdout=asyncio.subprocess.PIPE,
        stderr=asyncio.subprocess.PIPE
    )
    
    stdout, stderr = await process.communicate()
    if process.returncode != 0:
        raise Exception(stderr.decode())
    
    return stdout.decode(), stderr.decode()

async def archive_url_task(job_id: str, url: str, tags: List[str]):
    """Background task for archiving URLs"""
    try:
        cmd = ["archivebox", "add"]
        if tags:
            tag_str = ",".join(tags)
            cmd.extend(["--tag", tag_str])
        cmd.append(str(url))
        
        start_time = datetime.now()
        stdout, stderr = await run_in_container(cmd)
        end_time = datetime.now()
        duration = (end_time - start_time).total_seconds()
        
        jobs[job_id].update({
            "status": "completed",
            "end_time": end_time,
            "duration_seconds": duration,
            "output": stdout
        })
        
    except Exception as e:
        jobs[job_id].update({
            "status": "failed",
            "end_time": datetime.now(),
            "error": str(e)
        })
        logger.error(f"Job {job_id} failed: {e}")

@app.post("/archive", response_model=JobStatus)
async def start_archive(request: ArchiveRequest, background_tasks: BackgroundTasks):
    """Start an archival job and return immediately with a job ID"""
    job_id = str(uuid.uuid4())
    start_time = datetime.now()
    
    # Initialize job status
    jobs[job_id] = {
        "job_id": job_id,
        "status": "in_progress",
        "url": str(request.url),
        "start_time": start_time
    }
    
    # Schedule the archival task
    background_tasks.add_task(archive_url_task, job_id, str(request.url), request.tags)
    
    return JobStatus(**jobs[job_id])

@app.get("/status/{job_id}", response_model=JobStatus)
async def get_job_status(job_id: str):
    """Get the status of a specific job"""
    if job_id not in jobs:
        raise HTTPException(status_code=404, detail="Job not found")
    return JobStatus(**jobs[job_id])

@app.get("/health")
async def health_check():
    """Simple health check endpoint"""
    try:
        stdout, stderr = await run_in_container(["archivebox", "version"])
        return {
            "status": "healthy", 
            "archivebox": "available",
            "container": CONTAINER_NAME,
            "version": stdout.strip()
        }
    except Exception as e:
        logger.error(f"Health check failed: {str(e)}")
        return {"status": "unhealthy", "error": str(e)}

if __name__ == "__main__":
    import argparse
    
    parser = argparse.ArgumentParser(description='ArchiveBox API Service')
    parser.add_argument('--host', default='0.0.0.0', help='Host to bind to')
    parser.add_argument('--port', type=int, default=9000, help='Port to bind to')
    parser.add_argument('--container', default=CONTAINER_NAME, 
                       help='Docker container name for ArchiveBox')
    parser.add_argument('--log-level', default='INFO', 
                       choices=['DEBUG', 'INFO', 'WARNING', 'ERROR', 'CRITICAL'],
                       help='Logging level')
    
    args = parser.parse_args()
    
    if args.container:
        CONTAINER_NAME = args.container
    
    logger.setLevel(args.log_level)
    logger.info(f"Starting ArchiveBox service on {args.host}:{args.port}")
    logger.info(f"Using ArchiveBox container: {CONTAINER_NAME}")
    
    uvicorn.run(app, host=args.host, port=args.port)

So this is a little esoteric, but it meets a need I encountered; and it may meet yours if you use Espial, Keyboard Maestro and are on macOS.

For several years I’ve been using Espial a bookmark manager that looks and feels like Pinboard, but is both self-hosted and drama-free¹. Espial is easy to setup, stores its data in a comprehensible sqlite database and has an API, which comes in handy when it came to solving the problem I encountered.

Recently, the governor of Louisiana signed a bill requiring all public school classrooms in the state to display a poster-sized copy of the Ten Commandments. In the “Beforetimes” (before the current partisan Supreme Court took shape), this would have been struck down immediately as a violation of the Establishment Clause of the First Amendment. This bill is a clear violation of that clause. I imagine that the justices will dance around the cultural and historical significance of the document without stopping to consider the state’s motives in passing this law. While the proponents of the Ten Commandments aren’t wrong about its historical significance, the U.S. Constitution and its Amendments arguably hold more importance from the secular perspective that one must adopt in a public school.

Some vegetable seeds, particularly many exotic chilli pepper varieties and some Asian eggplants are tricky to germinate. After trying the obvious things - cold-induced forced dormancy (cold stratification), abundant moisture, high humidity, and temperatures over 80F, I’ve found that some seeds simply do not germinate with much success at all. But having read a number of articles on this problem, we decided to try an intensive chemical process to see if we could achieve better results. And it looks successful.

The Atmel AVR 8-bit microcontrollers have always been a favourite for tinkering; and the massive popularity of the Arduino based on the ATmega 168 and 328 MCUs introduced a lot of hobbyists to this series. The companion ATtiny series from Atmel were the poor stepchildren of the ATmega controllers to an extent - useful for small projects but often quite limited. However, the acquisition of Atmel by Microchip Technology in 2016 ushered in a new series of MCUs bearing the same moniker of ATtiny, but much more capable and innovative. They have been around for a while now, but many hobbyists are just beginning to poke around with these new capable MCUs.

Although FreeRTOS¹ is an indispensible tool for working on anything more than the simplest application on ESP32, there are some difficulties to master, such as multitasking. Multitasking using FreeRTOS is accomplished by creating tasks with xTaskCreate() or xTaskCreatePinnedToCore(). In both of these calls, one of the parameters is uxStackDepth which is the allocated stack size for the task. The FreeRTOS documentation on the subject is clear about the units for uxStackDepth:

Several people have asked me how we manage a very productive vegetable garden; so I’ve written this post as a brief description of one aspect our our approach - vermiculture.

One of our overarching family goals is sustainable living. It’s basically about leaving a small footprint. A practical component of this philosophical stance is dealing with food waste. We deal with kitchen waste with a combination of bokashi composting and vermicomposting (also known as vermiculture) It’s not for the faint-of-heart and some are horrified to learn that I keep thousands - possibly hundreds of thousands - of worms in our basement. But some have asked me to describe our process; so this article is meant just to document it. There is a lot of art and science to vermiculture and this is not meant to be a definitive guide to vermiculture.

“How do you get to Carnegie Hall” goes the old joke. “Practice, practice, practice.” But of course there’s no other way. If the science of talent development has taught us anything over the last fifty years, it’s that there is no substitute for strategic practice. Some even argue that innate musical abilities don’t exist. Whether it’s nature, nurture, or both, show me a top-notch musician and I’ll show you a person who has learned to practice well. Here we’ll take a dive into a set of practice techniques that I’ve developed, along with tools to realize them in the practice room.

Hazel is a centrepiece of my automation suite on macOS. I rely on it to watch directories and take complex actions on files contained within them. Recently I discovered an issue with files that are locked in the Finder. If files that otherwise match all the rules are locked, then Hazel will attempt to execute the rules. But the locked status may preclude execution. For example, I began seeing frequent Hazel notifications popups such as:

Fix your Anki streak - the script edition

Prerequisites and installation

Usage

Source code for the script

An API (sort of) for adding links to ArchiveBox

My ArchiveBox server

Using the ArchiveBox command line interface

FastAPI ArchiveBox server API

Adding a link for archival

Full source code

A Keyboard Maestro action to save bookmarks to Espial

Louisiana and the Ten Commandments

Improving vegetable seed germination with chemical pretreatment

A quick word on ATtiny 1-series interrupts

FreeRTOS stack size on ESP32 - words or bytes?

Our vermiculture process: A sustainable contribution

An approach to interleaved and variable musical practice: Tools and techniques

Telling Hazel not to match locked files