Ojisan Seiuchi

Hazel deletes custom file icons, and a workaround

January 26, 2023

I use Hazel extensively for automating file management tasks on my macOS systems. Recently I found that Hazel aggressively matches an invisible system file that appears whenever you use a custom file or folder icon. I’ll describe the problem and present a workaround.

In a handful of directories, I have a rule that prevents users (me) from adding certain file types. So the rule just matches any file that is not an image, for example, and deletes it. This is all well and good until to try to add a custom icon to this directory. Since the file Icon? that gets created as a result is not an image, the Hazel rule dutifully deletes it.

In my first try to fix it, I tried to match the filename on Icon? but that didn’t work because the actual name is Icon$'\r', Thinking this special character \r would be too hard to match, I moved onto plan B.

Plan B in this case is to match on a particular bit of file metadata, the kMDItemDisplayName key. This is what Finder displays, but is not the actual file name, thus explaining Hazel’s inability to match it properly. So if we run:

mdls -n kMDItemDisplayName Icon$'\r'
# prints kMDItemDisplayName = "Icon?"

Now, we just need to strip the double quotes and match. So before we put this into a rule, with a Passes shell script criterion, let’s think about what we’re trying to do and how the logic of this criterion works. The criterion is considered positive when it returns a 0 exit code. But in our case, we are setting up a rule that says “If the file is not an image and is not a custom icon file, then delete it." So if the file matches Icon? then we need to return 1 to negate the second clause of this rule. Here’s how to do it:

R=$(mdls -n kMDItemDisplayName "$1" | cut -d ' ' -f 3 | tr -d \")
R=$(echo "$R" | tr '?' 'x')
[[ "$R" =~ ^Iconx$ ]] && exit 1 || exit 0

Explanation:

cut -d ' ' -f 3 cuts the result of the previous command into fields separated by the space character and returns the third field of that list.
tr -d \" strips the quotes from the resulting string.
In the next line tr '?' 'x' changes all ? characters to x because that makes regex matching in the next line easier. Bash regular expressions are quite limited, so it’s easier this way.

There may be a different/better way, but this works!

AwesomeTTS Anki add-on: Use Amazon Polly

December 26, 2022

anki

As its name implies, the AwesomeTTS Anki add-on is awesome. It’s nearly indispensable for language learners.

You can use it in one of two ways:

Subscribe on your own to the text-to-speech services that you plan to use and add those credentials to AwesomeTTS. (à la carte)
Subscribe to the AwesomeTTS+ service and gain access to these services. (prix fixe)

Because I had already subscribed to Google and Azure TTS before AwesomeTTS+ came on the scene, there was no reason for me to pay for the comprehensive prix fixe option. Furthermore, since I’ve never gone above the free tier on any of these services, it makes no sense for me to pay for something I’m already getting for free. For others, the convience of a one-stop-shopping experience probably makes the AwesomeTTS+ service worthwhile.

But the developers have chosen to lock Amazon Polly behind their prix fixe service. As an Amazon Web Services customer already, this makes no sense for me. AWS already knows how to bill me for services; so as with the Google and Azure services I mentioned previously, I have no intention to pay twice. But, as opposed to Google and Azure TTS, those of us who aren’t AwesomeTTS+ subscribers have been locked out of Amazon Polly.

Until now.

The rest of the post is a description of how I bypassed this limitation.

Prerequisites

Before modifying the AwesomeTTS code, you need to get a couple things out of the way first.

AWS user account

First, you will need to be an AWS user. I’m not going to go into depth with this. Start here.

Install the AWS CLI tools

For simplicity, we are going to access the Amazon Polly TTS via the command line toolset provided by AWS. To install, start here. After installing the AWS CLI tools, you will need to add your credentials as described here.

Modify the AwesomeTTS add-on code

On my system, the add-on path is ~/Library/Application Support/Anki2/addons21/1436550454. Within the awesometts directory within that path, you will find the files that you need to modify. Both are in the service directory.

Modifications to `languages.py`

Find the class definition for StandardLanguage. Change this:

class StandardVoice(Voice):
    def __init__(self, voice_data):
        self.language_code = voice_data['language_code']
        self.voice_key = voice_data['voice_key']
        self.voice_description = voice_data['voice_description']

to this:

class StandardVoice(Voice):
    def __init__(self, voice_data):
        self.language_code = voice_data['language_code']

        # we need the audio_language_code for Amazon Polly service
        self.audio_language_code = voice_data['audio_language_code']
        self.voice_key = voice_data['voice_key']
        self.voice_description = voice_data['voice_description']

This change is required by the Amazon service because in the AWS CLI call we need to specify the language code in the format specified by the audio_language_code key in the voice info.

Modifications to `amazon.py`

In the original code, they throw an exception when you aren’t an AwesomeTTS+ subscriber. We need to reverse the logic and formulate our own call. To do this, change the original code here:

def run(self, text, options, path):

    if not self.languagetools.use_plus_mode():
        raise ValueError(f'Amazon is only available on AwesomeTTS Plus')

    voice_key = options['voice']
    voice = self.get_voice_for_key(voice_key)

    rate = options['rate']
    pitch = options['pitch']

    self._logger.info(f'using language tools API')
    service = 'Amazon'
    voice_key = voice.get_voice_key()
    language = voice.get_language_code()
    options = {
        'pitch': pitch,
        'rate': rate
    }
    self.languagetools.generate_audio_v2(text, service, 'batch', language, 'n/a', voice_key, options, path)

to:

def run(self, text, options, path):
    # Nope ↓
    # raise ValueError(f'Amazon is only available on AwesomeTTS Plus')
    rate = options['rate']
    pitch = options['pitch']
    voice_key = options['voice']
    voice = self.get_voice_for_key(voice_key)
    if self.languagetools.use_plus_mode():
        self._logger.info(f'using language tools API')
        service = 'Amazon'
        voice_key = voice.get_voice_key()
        language = voice.get_language_code()
        options = {
            'pitch': pitch,
            'rate': rate
        }
        self.languagetools.generate_audio_v2(text, service, 'batch', language, 'n/a', voice_key, options, path)
    else:
        # roll your own, baby; needs AWS CLI installed
        # along with credentials therewith
        lang_code = voice.audio_language_code.replace('_', '-')
        voice_name = voice.get_key()
        (engine, voice_id) = (voice_name['engine'], voice_name['voice_id'])
        cmd = f'aws polly synthesize-speech --engine {engine} --language-code {lang_code}
        		--output-format mp3 --text "{text}" --voice-id {voice_id} "{path}"'
        cmd_list = shlex.split(cmd)
        resp = subprocess.run(cmd_list, stdout=subprocess.DEVNULL, stderr=subprocess.DEVNULL)

We also need to import shlex and subprocess which will be used to setup and execute the shell process that communicates with AWS:

import subprocess
import shlex

After you’ve made these changes, you should now have access to Amazon Polly via the AWS CLI call.

Although I’m a relative newcomer to Obsidian, I like what I see, especially the templating and data access functionality - both that provided natively and through the Templater and Dataview plugins. One missing piece is the ability to dynamically update the YAML-formatted metadata in the frontmatter of Obsidian’s Markdown documents. Several threads on both the official support forums and on r/ObsidianMD have addressed this; and there seems to be no real solution.

There are a couple features of the Dataview plugin for Obsidian that aren’t documented and are potentially useful. For the start of the week, use date(sow) and for the end of the week date(eow). Since there’s no documentation as of yet, I’ll venture a guess that they are locale-dependendent. For me (in Canada), sow is Monday. Since I do my weekly notes on Saturday, I have to subtract a couple days to point to them.

Most language learners are familiar with Forvo, a site that allows users to download and contribute pronunciations for words and phrases. For my Russian studies, I make daily use of the site. In fact, to facilitate my Anki card-making workflow, I am a paid user of the Forvo API. But that’s where the trouble started. When the Forvo API works, it works OK, often extremely slow. But lately, it has been down more than up.

Recently, someone asked a question on r/Anki about changing and existing cloze-type note to a regular note. Part of the solution involves stripping the cloze markup from the existing cloze’d field. A cloze sentence has the form Play {{c1::studid}} games. or Play {{c1::stupid::pejorative adj}} games. To handle both of these cases, the following regular expression will work. Just substitute for $1. {{c\d::([^:}]+)(?:::+[^}])}} However, the Cloze Anything markup is different. It uses ( and ) instead of curly braces.

I make a lot of Anki cards, so I’m on a constant quest to make the process more efficient. Like a lot of language-learners, I use images on my cards where possible in order to make the word or sentence more memorable. Process When I find an image online that I want to use on the card, I download it to ~/Documents/ankibound. A Hazel rule then grabs the image file and converts it to a .

Anki users are protective of their streak - the number of consecutive days they’ve done their reviews. Right now, for example, my streak is 621 days. So if you miss a day for whatever reason, not only do you have to deal with double the number of reviews, but you also deal with the emotional toll of having lost your streak. You can lose your streak for one of several reasons.

Welcome to Part III of a deep dive into my Anki language learning decks. In Part I I covered the principles that guide how I setup my decks and the overall deck structure. In the lengthy Part II I delved into my vocabulary deck. In this installment, Part III, we’ll cover my sentence decks. Principles First, sentences (and still larger units of language) should eventually take precedence in language study. What help is it to know the word for “tomato” in your L2, if you don’t know how to slice a tomato, how to eat a tomato, how to grow a tomato plant?

In Part I of my series on my Anki language-learning setup, I described the philosophy that informs my Anki setup and touched on the deck overview. Now I’ll tackle the largest and most complex deck(s), my vocabulary decks. First some FAQ’s about my vocabulary deck: Do you organize it as L1 → L2 or as L2 → L1, or both? Actually, it’s both and more. Keep reading. Do you have separate subdecks by language level, or source, or some other characteristic?

Hazel deletes custom file icons, and a workaround

AwesomeTTS Anki add-on: Use Amazon Polly

Prerequisites

AWS user account

Install the AWS CLI tools

Modify the AwesomeTTS add-on code

Modifications to `languages.py`

Modifications to `amazon.py`

Using fswatch to dynamically update Obsidian documents

Week functions in Dataview plugin for Obsidian

Scraping Forvo pronunciations

A regex to remove Anki's cloze markup

Anki: Insert the most recent image

Altering Anki's revlog table, or how to recover your streak

A deep dive into my Anki language learning: Part III (Sentences)

A deep dive into my Anki language learning: Part II (Vocabulary)

Prerequisites

AWS user account

Install the AWS CLI tools

Modify the AwesomeTTS add-on code

Modifications to languages.py

Modifications to amazon.py

Modifications to `languages.py`

Modifications to `amazon.py`