Selected Thoughts

Exploring APIs and data structures with Jupyter notebooks

2020-03-02T00:00:00+01:00

Recently a colleague shared a useful technique for exploring Web APIs with me: Jupyter notebooks.

Previously I used to use Bash scripts and curl for tasks like this. Other colleagues preferred GUI tools like Postman.

Jupyter brings both worlds together:

You can write code and have access to Python libraries
- Requests library for HTTP requests
- Pandas library for data analysis
You get documentation to share with your colleagues (and your future self)
- GitHub will render Jupyter notebooks as static HTML
- You can include images, tables, and even interactive elements like maps

By the way: This post was written in a Jupyter notebook itself.

Interested? Let’s get started by setting everything up.

The first step is (of course) to install the Jupyter package:

pip install jupyterlab

Note: Depending on when you read this (it was written in early 2020), you might have to check if pip is the Python 3.x version of Python or still the legacy Python 2.7 version. On my machine I had to use the pip3 command that Homebrew created. If that’s the case, the Python executable is most likely also named python3. To make it less confusing, I’ll be using the regular pip and python commands in this post.

Next you can start Jupyter:

python -m jupyterlab

You’ll be greeted with a Web UI like this:

In this post I’ll be using some Python libraries. Here are it’s version number so that you can recognize if your version differ:

import pkg_resources

[pkg_resources.get_distribution(lib) for lib in ['jupyterlab', 'requests', 'curlify', 'pandas', 'nbconvert']]

[jupyterlab 1.2.6 (/usr/local/lib/python3.7/site-packages),
 requests 2.21.0 (/usr/local/lib/python3.7/site-packages),
 curlify 2.2.1 (/usr/local/lib/python3.7/site-packages),
 pandas 1.0.0 (/usr/local/lib/python3.7/site-packages),
 nbconvert 5.6.1 (/usr/local/lib/python3.7/site-packages)]

Getting started with Requests

The first library I want to introduce is Requests, the de facto standard HTTP library for Python.

If you haven’t done so, you should install it using:

pip install requests

Then you are able to load it:

import requests

Let request something simple to try out requests (no pun intended):

response = requests.request('GET', 'http://httpbin.org/json')
response.status_code

To get a pretty output from the JSON data, a quick helper function comes in handy:

import json
def pp(item):
    print(json.dumps(item, indent=2))

pp(response.json())

{
  "slideshow": {
    "author": "Yours Truly",
    "date": "date of publication",
    "slides": [
      {
        "title": "Wake up to WonderWidgets!",
        "type": "all"
      },
      {
        "items": [
          "Why <em>WonderWidgets</em> are great",
          "Who <em>buys</em> WonderWidgets"
        ],
        "title": "Overview",
        "type": "all"
      }
    ],
    "title": "Sample Slide Show"
  }
}

If you want to print the response headers, you need to remember that Headers is a CaseInsensitiveDict structure. Wrappingg it in a dict() function enables you to print it using the json.dumps function.

pp(dict(response.headers))

{
  "Date": "Wed, 04 Mar 2020 07:05:33 GMT",
  "Content-Type": "application/json",
  "Content-Length": "429",
  "Connection": "keep-alive",
  "Server": "gunicorn/19.9.0",
  "Access-Control-Allow-Origin": "*",
  "Access-Control-Allow-Credentials": "true"
}

You can get a curl version of your request by using the curlify package:

pip install curlify

import curlify
print(curlify.to_curl(response.request))

curl -X GET -H 'Accept: */*' -H 'Accept-Encoding: gzip, deflate' -H 'Connection: keep-alive' -H 'User-Agent: python-requests/2.21.0' http://httpbin.org/json

Using Pandas to explore JSON documents

Pandas is a data analysis and manipulation library that’s popular in the Data Science community. I find it very useful to explore JSON documents.

Let’s first install the package (you might need to use pip3):

pip install pandas

Now let’s take a look how it would work without pandas:

r = requests.request('GET', 'https://api.github.com/users/janahrens/repos')
json = r.json()
json.__class__

list

We now know that the call returns a JSON list. Let’s examine what items this list has by looking at the first one.

json[0].keys()

dict_keys(['id', 'node_id', 'name', 'full_name', 'private', 'owner', 'html_url', 'description', 'fork', 'url', 'forks_url', 'keys_url', 'collaborators_url', 'teams_url', 'hooks_url', 'issue_events_url', 'events_url', 'assignees_url', 'branches_url', 'tags_url', 'blobs_url', 'git_tags_url', 'git_refs_url', 'trees_url', 'statuses_url', 'languages_url', 'stargazers_url', 'contributors_url', 'subscribers_url', 'subscription_url', 'commits_url', 'git_commits_url', 'comments_url', 'issue_comment_url', 'contents_url', 'compare_url', 'merges_url', 'archive_url', 'downloads_url', 'issues_url', 'pulls_url', 'milestones_url', 'notifications_url', 'labels_url', 'releases_url', 'deployments_url', 'created_at', 'updated_at', 'pushed_at', 'git_url', 'ssh_url', 'clone_url', 'svn_url', 'homepage', 'size', 'stargazers_count', 'watchers_count', 'language', 'has_issues', 'has_projects', 'has_downloads', 'has_wiki', 'has_pages', 'forks_count', 'mirror_url', 'archived', 'disabled', 'open_issues_count', 'license', 'forks', 'open_issues', 'watchers', 'default_branch', 'permissions'])

With the knowledge of available fields, we could now use standard Python methods to further explore the data.

This process gets a lot easier with Pandas and it’s json_normalize function. With json_normalize the data gets parsed into a DataFrame, which is a core data structure for “Two-dimensional, size-mutable, potentially heterogeneous tabular data”. In other words: It represents the data as a table.

from pandas import json_normalize
df = json_normalize(r.json())
df.shape

(30, 98)

Calling the .shape method is a good first step to explore the data. It shows that our DataFrame/table has 30 rows and 98 columns.

Let’s see what those columns are:

df.columns

Index(['id', 'node_id', 'name', 'full_name', 'private', 'html_url',
       'description', 'fork', 'url', 'forks_url', 'keys_url',
       'collaborators_url', 'teams_url', 'hooks_url', 'issue_events_url',
       'events_url', 'assignees_url', 'branches_url', 'tags_url', 'blobs_url',
       'git_tags_url', 'git_refs_url', 'trees_url', 'statuses_url',
       'languages_url', 'stargazers_url', 'contributors_url',
       'subscribers_url', 'subscription_url', 'commits_url', 'git_commits_url',
       'comments_url', 'issue_comment_url', 'contents_url', 'compare_url',
       'merges_url', 'archive_url', 'downloads_url', 'issues_url', 'pulls_url',
       'milestones_url', 'notifications_url', 'labels_url', 'releases_url',
       'deployments_url', 'created_at', 'updated_at', 'pushed_at', 'git_url',
       'ssh_url', 'clone_url', 'svn_url', 'homepage', 'size',
       'stargazers_count', 'watchers_count', 'language', 'has_issues',
       'has_projects', 'has_downloads', 'has_wiki', 'has_pages', 'forks_count',
       'mirror_url', 'archived', 'disabled', 'open_issues_count', 'license',
       'forks', 'open_issues', 'watchers', 'default_branch', 'owner.login',
       'owner.id', 'owner.node_id', 'owner.avatar_url', 'owner.gravatar_id',
       'owner.url', 'owner.html_url', 'owner.followers_url',
       'owner.following_url', 'owner.gists_url', 'owner.starred_url',
       'owner.subscriptions_url', 'owner.organizations_url', 'owner.repos_url',
       'owner.events_url', 'owner.received_events_url', 'owner.type',
       'owner.site_admin', 'permissions.admin', 'permissions.push',
       'permissions.pull', 'license.key', 'license.name', 'license.spdx_id',
       'license.url', 'license.node_id'],
      dtype='object')

The list of columns itself isn’t a very good demonstration of Pandas analysis capabilities. It gets more useful if we use it’s sorting and filtering capabilities.

Let’s find out what GitHub repositories have the most stars and only select some of the columns:

df.sort_values(by='stargazers_count', ascending=False).head()[['name', 'created_at', 'size', 'language', 'stargazers_count']]

	name	created_at	size	language	stargazers_count
24	threema-protocol-analysis	2014-03-16T14:38:56Z	311	TeX	17
11	ipconfig-http-server	2014-05-12T06:15:38Z	152	C	6
29	yesod-oauth-demo	2012-05-15T21:02:29Z	216	Haskell	5
27	xing-api-haskell	2013-01-28T07:28:41Z	508	Haskell	5
4	dotfiles	2011-09-05T09:39:29Z	2337	Shell	5

We can also request entries in the table using the .iloc method. The table can be transformed (change rows and columns) with the .T method:

df.iloc[[0]].T

	0
id	207344689
node_id	MDEwOlJlcG9zaXRvcnkyMDczNDQ2ODk=
name	alb-fargate-demo
full_name	JanAhrens/alb-fargate-demo
private	False
...	...
license.key	NaN
license.name	NaN
license.spdx_id	NaN
license.url	NaN
license.node_id	NaN

98 rows × 1 columns

Pandas can do a lot more and it’s definetely worth to take a look at the 10 minutes to pandas guide.

Bonus: Generating a blog post from a Jupyter notebook

My blog gets generated by feeding Markdown files into Jekyll. Using nbconvert I was able to convert this notebook into a Markdown file. The only thing I had to add manually was the header for Jekyll. The rest of this post is directly from the notebook.

First install the nbconvert package

pip install nbconvert

Then you can invoke nbconvert on this file:

python -m nbconvert files/explore-apis.ipynb –to markdown –stdout > _posts/2020-03-02-explore-apis.markdown

Videos as podcasts

2019-11-03T00:00:00+01:00

Today I discovered that you can easily convert YouTube (and a few more) videos to audio files using youtube-dl.

I’m found that these parameters work best for me:

$ youtube-dl -x --audio-format mp3 "https://www.youtube.com/watch?v=2u0sNRO-QKQ"

If you’re an Overcast premium subscriber you can easily upload the extracted files and listen on the go.

I hope you find this useful, too.

Small Docker images with embedded go binaries

2019-08-10T00:00:00+02:00

I recently wanted to include Go binaries in a Docker image and tried to make the image as small as possible. My goal was to speed up its execution and download time.

The solution is actually pretty simple:

FROM golang:1.12.7-alpine3.10 AS builder
RUN apk --no-cache add git
RUN go get github.com/wakeful/yaml2json
RUN go get github.com/santhosh-tekuri/jsonschema/cmd/jv

FROM alpine:3.10
WORKDIR /root/
COPY --from=builder /go/bin/yaml2json /usr/local/bin
COPY --from=builder /go/bin/jv /usr/local/bin

The trick is to use Dockers’ multi-stage build feature. While Docker is building the image, it actually creates another image called “builder” first. This image gets discarded after the binaries got copied over to the final image and thus the final image doesn’t need the entire go build system.

One caveat is that the builder image and the final image have to be binary compatible. In my first attempt I used the golang-1.12.7 image for the builder image. The binaries got build successfully but wouldn’t run in the final image. The reason for that was that this image was using Debian buster under the hood and not Alpine Linux. Fortunately there’s also the golang-1.12.7-alpine3.10 image available on Dockerhub that is based on the same Alpine version that the final image uses.

If you want to experiment with this, you can find the complete code in my multi-stage-go repository.

Hello Signal, Goodbye PGP

2019-08-04T00:00:00+02:00

After years of maintaining my key I decided to drop PGP and move on to Signal. In this post I’ll explain why I made this decision.

For a long time I thought that I needed to maintain a PGP key. It was just something that you had to do if you wanted to be serious about your online life.

I went to great lengths to create the perfect gpg keypair. My master key got stored offline in a secure location and I used a YubiKey to access my private subkey. To establish trust I went to a keysigning party at the 30C3 and managed to collect 43 signatures. Every year I went through the process of renewing my key to ensure that if I’d loose access some day, it won’t be valid forever. The next renewal is due later this year and I’m gonna give it a pass. It’s time to say goodbye to my old friend B911 E6A2 2B4F 3B5F.

Let’s face it: Almost nobody uses PGP. Its adoption rate is very low despite being around since 1991. Maybe it’s because it’s too complicated, even though tools like GPG Suite make it very easy. The problem remains that not enough people use it 28 years after its initial release to make a difference.

Encrypting email isn’t a good idea in general. Put all complications with PGP and its adopting rate aside, you can only encrypt email bodies. Still, all metadata is unencrypted. Your provider and other parties can see who is communicating with whom and which email subject is used.

Another problem is the lack of forward secrecy. Once your PGP key gets compromised, all your past communication is also (potentially) compromised. Moxie Marlinspike explains the problem in a talk where he introduces the Signal protocol.

If you want to read more about the many problems of PGP, I can recommend The PGP Problem.

That’s why I decided to use Signal instead. It’s an open-source project that uses modern cryptography (elliptic curve) and provides forward secrecy. Signal is available as a mobile app for Android and iOS. You can also use Signal on Linux, Windows and macOS - so basically on every platform. You don’t need to do anything special to get started. Just download the app, register an account and you’re all set. No key generation, no key exchange. Signal trusts the keys of your contact on first use (TOFU) and alerts you if it changes.

When I migrated to Signal, I wanted to provide everyone with the opportunity to contact me. As it requires a phone number to register, I would have had to publish my private number (I wasn’t willing to do that). Luckily, I found a well-written guide that explains how to use Signal without giving out your phone number. In that guide there are a couple of ways listed on how you can obtain an alternative phone number. I ended up using satellite which is a VoIP app that is only available in Germany. Even though satellite doesn’t support text messages, I could still register the account because Signal has a phone call fallback for verification.

I recommend that you give Signal a try and stop worrying about PGP. You can send me your feedback at +49 156 7856 2789.

cron is dead, long live launchd!

2017-01-13T00:00:00+01:00

Now that I finally created my tarsnap backup script, how do I execute it regularly? Oh, I know: My Mac is just an Unix system, I’ll use cron!

At least that’s what I thought I’ll do. After a few attempts to get cron to do the job, I learned that there’s a better way on macOS: launchd.

launchd does a lot more than executing scripts cron-style. Like systemd on Linux, launchd is a replacement for a lot of old school Unix tools, like cron, inetd, init, etc.

At it’s core, launchd distincts daemons and agents. Dameons are processes that always run in the background, while agents describe regular jobs that are to be executed on certain events. There are a lot of different events to choose from. For example you can trigger an agent, when a device gets mounted, when a file gets created, or when a certain time arrives.

What really helped me in learning how to write my first launchd agent was launchd.info. Unlike the Apple documentation, it contains useful snippets and concise explanations. I highly recommend that you also have a look at the launchd agents that some of your applications put into ~/Library/LaunchAgents.

Below you can see the agent that I ended up creating. You can learn how to load/unload agents and about the meaning of the different options at launchd.info.

If you’re testing your script and you don’t want to wait for the next hour to arrive, you can start it immediately with launchctl start eu.jan-ahrens.tarsnap.

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
    <key>Label</key>
    <string>eu.jan-ahrens.tarsnap</string>
	<key>EnvironmentVariables</key>
	<dict>
		<key>PATH</key>
		<string>/bin:/usr/bin:/usr/local/bin</string>
	</dict>
    <key>ProgramArguments</key>
    <array>
	<string>/bin/bash</string>
	<string>/Users/jan/bin/run-tarsnap-backup</string>
    </array>
    <key>StartInterval</key>
    <integer>3600</integer>
    <key>StandardOutPath</key>
    <string>/Users/jan/.tarsnap.log</string>
    <key>StandardErrorPath</key>
    <string>/Users/jan/.tarsnap.log</string>
    <key>KeepAlive</key>
    <dict>
	<key>NetworkState</key>
	<true/>
     </dict>
    <key>ExitTimeout</key>
    <integer>900</integer>
    <key>Nice</key>
    <integer>10</integer>
</dict>
</plist>

P.S.: cron itself is implemented as a launchd daemon. You can find it at /System/Library/LaunchDaemons/com.vix.cron.plist.

The German income tax algorithm

2015-12-13T00:00:00+01:00

You might have heard it before: The German tax system is complicated. There are a multitude of rules that can be applied. But how complicated is it exactly? And how the heck is my income tax calculated? I recently started to find answers to these questions.

The “Lohnsteuer”, as it’s called in German, used to be calculated with the help of a simple table - the “Lohnsteuer-Tabelle”. You could look up your annual salary to find out how much taxes you had to pay. In 2004 this changed (The German Wikipedia has some details on this - unfortunately only in German).

The interesting thing about it is that the German government decided to replace the table with an algorithm. I think, their intention was quite obvious: Algorithms don’t leave much room for interpretation.

Publishing an algorithm is a good idea, but how exactly do you do this? To start with, you could publish an implementation in a well known language - for example in Java. Java developers would like this. They could simply use this implementation.

However, luckily not everyone uses Java. Having only one implementation in a popular language is problematic for developers that need to use other languages (think about iOS apps). The government agency that published the algorithm could of course release implementations in different languages. This leads to a scalability problem. To name a few: Java, C, Ruby, Erlang, Lua, Go, Scheme, Haskell, JavaScript, VisualBasic, C#, SmallTalk and probably much more. To complicate things, implementing the algorithm isn’t a onetime task. Laws change and with them the algorithm. The solution that was chosen by the German tax office is both, fascinating and shocking: They’re publishing flowcharts.

Yup! You heared right. They’re publishing a thirty-something page document that includes a list of input parameters, a list of output parameters and a flowchart.

To be honest: This was the first time I ever saw a flowchart outside of an education context. “Are they really expecting people to implement their algorithm by reading the flowchart?”, I thought. It seems like an error-prone and tedious task: translate thirty pages of flowchart blocks into an implemenation for the language of your choice.

And it’s true: The task is error-prone and tedious. That’s why there’s a table at the end of the flowchart document. Ironically, that table is quite similar to the “Lohnsteuer-Tabelle” that was used before.

Then it got me thinking. Isn’t there a better way to communicate an algorithm in a way that it can be implemented in any programming language? Something like a meta language?

Along with their flowchart document, they published XML files that should achieve exactly that (they call it “XML pseudocode”). In my opinion, they didn’t quite succeed. Although, the file contains elements for abstract concepts, like branches, it still includes expressions that hide in regular text. Those expressions aren’t abstracted from the programming language. It’s quite clear that they were written with Java in mind.

<IF expr="STKL == 4">
  <THEN>
    <EVAL exec="SAP= BigDecimal.valueOf (36)"/>
    <EVAL exec="KFB= (ZKF.multiply (BigDecimal.valueOf (3624))).setScale (0, BigDecimal.ROUND_DOWN)"/>
  </THEN>
  <ELSE> <!-- ... --> </ELSE>
</IF>

In the end, I got curious and started to implement the algorithm straight from the flowchart. I chose Ruby, because there’s no Ruby implementation, yet. Also I’m quite fluent in it. It took me a whole evening to parse and implement the algorithm by following the flowchart boxes. After that I spent an additional evening to find the mistakes I made while implementing it. Maybe it would have been wiser to write a compiler for the XML format.

Nevertheless, I published the results as a Ruby gem: lohnsteuer. I hope this will be helpful for you.

Analyse app traffic with mitmproxy

2015-09-22T00:00:00+02:00

This post is part of a series on reverse engineering mobile apps.

Why reverse engineer mobile apps?
Analyse app traffic with mitmproxy

When you want to inspect an app’s behavior, the first step is to look at the traffic it produces. In this post I’ll show you how to do this with “mitmproxy”.

Fortunately, most apps encrypt their traffic nowadays. This means that you will have a hard time using a regular sniffer like tcpdump or Wireshark. Instead you have to do a man-in-the-middle attack to see what traffic the app produces. I’m using mitmproxy for this.

After you installed and started mitmproxy on your computer, you need to configure your phone to send its traffic through your computer.

On Android, this can be done by connecting both devices with the same Wi-Fi. Your computer needs to serve as a proxy server for your phone. This can be configured under “Settings → Wi-Fi”. Press long on the network you’re connected to, select “Advanced options” and put your computer’s IP address as “Proxy hostname” and the mitmproxy port as “Proxy port” (the default setting is 8080).

To setup mitmproxy for other platforms, have a look at the documentation.

Once your app sends its traffic through your computer, you need to get your phone to trust the mitmproxy certificate authority. This process has to be repeated every time you change the CA in mitmproxy. For Android, iOS and Windows phone, this is very easy. Just open the browser on your phone, visit mitm.it, select your platform and that’s it.

Now you can open the app you want to inspect and see the unencrypted traffic in mitmproxy on your computer.

If you don’t see traffic popping up or the app is not getting any data, this might have various reasons. Before assuming that the MITM attack failed, you should keep in mind that the data might be still cached on the device. For example, try “pull-to-refresh” or closing and reopening the app to force it to get fresh data.

It’s possible that the app doesn’t respect the proxy settings on the phone and thus escapes the MITM attack. This would be the case if the app still gets data and you don’t see activity in mitmproxy. You can use one of the other techniques to intercept the traffic, when you suspect that the setting isn’t respected.

Sometimes app developers configure their app not to trust the certificate authorities provided by the mobile phone. They bundle the correct certificate or certificate authority with their app. This technique is called Certificate Pinning. In those cases it’s not possible to inspect the traffic even if the traffic gets routed through your computer. The app simply rejects the fake certificate authority and refuses to connect to its servers. You can spot those cases when neither the app nor mitmproxy get any data. If we want to continue our analysis, we have to go one step further and replace the pinned certificate or certificate authority inside the app.

It’s also possible that the app isn’t using HTTP(S) to get its data. I came across such a situation when I tried to inspect the traffic of the Threema app.

Threema opens a regular socket and implements its own protocol. Prebuild MITM tools can’t handle those situations because they know nothing about the protocol. Even though the apps traffic reaches the app, it can’t MITM attack it. Once again, simple traffic analysis doesn’t help here.

Although, traffic analysis has it’s limitations I still recommend it as a first step. It’s simple to setup and a lot of apps can be analysed in a short amount of time.

In the next post I will show you how to learn more about an app by looking at it’s binary. This post will focus purely on Android apps.

Why reverse engineer mobile apps?

2015-09-07T00:00:00+02:00

This post is part of a series on reverse engineering mobile apps.

Why reverse engineer mobile apps?
Analyse app traffic with mitmproxy

Mobile apps are in demand - for several years now. Every business needs an app, even tiny ones. Sometimes I’m under the impression that building mobile apps became, what was in the early 2000s, building websites. “Hey, your business doesn’t have a website, therefore it isn’t future proof. Here, let me build (and sell) you one.” Replace the word “website” with “mobile app” and you have today’s situation.

Nevertheless this post isn’t to moan about the status quo. It’s about knowing what’s going on with your data. If you think about it, “mobile app” is just a fancy word for “software”. Software, that runs on your mobile phone. Software, that messes with your data. Data like your current location, your contacts, your photos, and so on. Most of the time you’ll have a pretty good understanding of what your data is being used for, but what if there are doubts? In the past, there were apps, that stole your address book, in order to provide you with “valuable insights and opportunities”. Do you want this?

The different mobile platforms offer different security models to manage the access to your data. On iOS, for example, the user is asked if she wants to give an app access to her address book. When a user installs an app on Android, he has to confirm, whether it gets access to his address book. There is no choice left. Either you give access, or you can’t install the app at all. No matter how an app got access to the data, how will a user know what happens with it next?

Image a local transport providers’ app, that gives you the directions to the homes of your friends. In order to simplify the process, the app uses the address book data to now where your friends live. It’s not necessary to upload the whole address book to build this feature. The user selects a friend’s address and the app sends this particular information to the server, in order to give directions. On the other hand, with the next update, the local transport provider could change their implementation and upload every address to their servers. Maybe they want to do this, to suggest that you visit your friends more often - automatically: “Hey, it’s time to visit Jane again. It’s only 15 minutes if you take the next subway”. Wouldn’t this be an innovative feature?

The point I want to make is that we don’t know when and how apps use our data. If the app is Open Source Software (or even Free Software), we can have a look at its source code. Unfortunately, it’s not that easy. How do you know that the source code was used to build the binary, you downloaded from the app store? Reproducible builds could help, but providing them isn’t a trivial task. Debian is working hard to make every binary package reproducible, but they’re still not there, yet. I’m not aware that there’s a similar project for mobile apps.

The source code of most commercial apps isn’t available anyway. In those cases it’s good to have a closer look at the app’s behavior from time to time. I feel it’s important to do so, to raise the companies awareness, make them fear bad press, and keep them from messing around with your data. The more people know about how to do this, the more pressure will be put on the app owners.

In the next post of this series, I’ll be looking at your options to find out what an app is doing.

tmux - my terminal window manager

2015-02-20T00:00:00+01:00

A dark full-screen terminal window. That is what you see mostly on my screen. My fascination with the terminal started when I found out how to run COMMAND.COM on my parents Windows computer. What a joy! I could edit files with “edit”, start “fdisk” and explore the file system with “DIR”. I felt like one of those cool hackers from the movies! A few years later I was lucky enough to discover that Windows and COMMAND.COM isn’t the most effective way to work and I started to learn about Linux and Bash.

Nowadays I’m still using terminal windows to get stuff done. Until recently it really where windows. Many windows. My favorite shortcut on my Linux machine was Ctrl-Alt-t - start a new terminal. At work I’m using a Mac. My muscle memory contains key combinations to create terminal windows and switch between them on both systems. It was okay. It never really bothered me - until a colleague introduced me to “tmux” - the terminal multiplexer.

The idea of tmux is really simple. Instead of having multiple windows with one shell prompt each, why not have a single terminal window with many shell prompts. tmux saves me from opening a lot of terminal windows. But wait, there’s more!

You can split windows (horizontally and vertically), scroll and search through the terminal output, copy-and-paste from inside tmux, manage groups of windows and use it for remote pair programming. Interested? Then let’s get started with the vocabulary.

Vocabulary

Windows, panes and sessions. Those are the building blocks. When I started to learn about tmux it took me some time to get the meaning. What helped me was to map the tmux concepts to those of my beloved terminal programs iTerm and gnome-terminal.

You can think of tmux as the equivalent of your terminal program ( iTerm, gnome-terminal). A tab in your terminal program is called “window” in tmux. When you would open another window inside your terminal program, you start a new “session” in tmux. Splitting one tab of your terminal window tabs into different parts is what tmux calls splitting a window into “panes”. Unfortunately you can’t do that with gnome-terminal, so it’s not included on the screenshot.

The prefix key

The first thing that my colleague recommended me to configure was the “prefix key”. It’s the key combination that tmux intercepts to recognize its commands. By default the prefix key is Ctrl-b. This combination is also used in Emacs to move one character backward so I ended up changing the prefix key to Ctrl-]. It’s easy to type on an English keyboard and not bound in any of the programs I use. In most posts about tmux you’ll see that people reference the prefix key as prefix instead of assuming that it’s Ctrl-b. I’ll do the same and write for example prefix c instead of Ctrl-b c (default prefix key) or Ctrl-] c (my custom prefix key).

If you haven’t done already it’s now time to start tmux and follow along. Reading about tmux is not as much fun as reading about tmux and immediately trying it. Just open your regular terminal program, install tmux and start a new session with tmux new.

Windows

Type prefix c to create a new window. At the bottom of the screen you see the “status line”. An asterisks next to the window name shows the active window. It should currently be the second one, because you just created a new window.

tmux automatically uses the name of the current running program as the name of the window. If you’re running a Bash shell, your window will be named “bash”. When you run a program inside that Bash (like “man tmux”), the name of the window will change to “man”. If you don’t want your window to automatically change its name, press prefix , to assign a name. Try it by naming your newly created window “demo”.

tmux starts to count its windows from “0” on. To switch back to the first window, type prefix 0. You can also used prefix n and prefix p to jump to the next and previous window. That enough with windows for now, let’s focus on panes. Close your “demo” window (prefix 1) by ending the shell process (exit or Ctrl-d).

Panes

To split your current window in two horizontal panes type prefix ". You can move to the next pane with prefix o. A very cool feature is to directly jump to a pane by typing prefix q (you’ll see numbers appearing in the panes) and then press the number of the pane you want to jump to (prefix q 0 for example). That feature blew my mind when I discovered it.

If you want to focus on one pane you can zoom it with prefix z. In the status bar you’ll see a “Z” appearing next to the window name. Press the zoom key again to shrink the pane back to its original size.

Vertically panes can be created with prefix %. Closing a pane requires you to end its process (for example the shell). If for any reason that process gets stuck (for example a dead ssh connection) use prefix x to kill that pane. Don’t worry about accidentally killing a pane - you’ll be asked to confirm before a pane gets killed. If you only have one pane, killing the pane means also killing the window.

Sessions

Remember: Opening a new session is like opening a new terminal window. Unfortunately there is no default key combination to create a new session. You can get around it by going to the tmux command prompt (prefix :) and executing the new-session command.

The first thing that you want to do with your new session is to give it a name. Having a name for your session will make it much easier to differentiate between different ones. Press prefix $ to rename your current session.

Switching between sessions is easy: prefix s shows a list of session. You can use the arrow keys, Ctrl-n and Ctrl-p (Emacs keys) or the number of the session to jump to it. When you have more than two session in that list you’ll be glad that you assigned names.

To close a session you have to close all of its windows. You’ll exit tmux when you close your current session. Don’t worry though, the other session are still running in tmux. With tmux attach you can return to them. Speaking of attaching: Whenever you feel the need to exit tmux, you can press prefix d to detach tmux from your terminal window.

How I’m using tmux

The session where I’m currently writing this post is named “blog”. I started two windows in that session: “edit” and “man”. The edit window contains two horizontally split panes. One pane running Emacs and the other pane running Jekyll. In the “man” window I’ve opened the tmux manpage.

I’m using sessions to arrange the different kind of things that I do. When I write code I’m created a new session for that project I’m working on. I use windows and panes for the compile output, manpages etc. The first window runs an emacsclient in most of my sessions.

Protips

Change the prefix key to something that fits your needs. If you’re using vim, keeping Ctrl-b should be fine.
Force yourself to learn the tmux key combinations. Stop using your terminal window shortcuts. Press prefix ? to get all available key combinations.
Change the base index for panes and windows to “1”. You can then press prefix 1 to get to the first window and prefix q 1 to get to the first pane. This is more naturally to me given that the numbers on my keyboard start from “1” on.
Use a tmux theme. I started by forking the “neverland” colorscheme by Lowe Thiderman.
Familiarize yourself with the copy-mode. Being able to copy and paste everything that appears in tmux is a killer feature.
Use the copy-mode also to scroll and search your current pane.

JSON Schema talk

2014-08-27T00:00:00+02:00

Some time ago Jens and I started the WebAPI Hamburg Meetup. At the last meetup I did a lightning talk on JSON Schema.

It’s a rather basic talk with the intention of motivating people to look into JSON Schema for their APIs. You can have a look at the slides below:

If you are on a small screen, you might want to open the slides in a new tab.

I created the slides using remark, which I can highly recommend. You can write your slides using Markdown and that’s really great.