YAML vs JSON Config Files

01 Apr 2025 · #system-design

Confession: I'm cheating on my first love.

JSON is great. She's always there for me. I tell her my deepest secrets, and she keeps them safe. She's dependable and helps me talk with others.

But my recent fling with YAML has me wondering... Is there someone better out there? Someone who just gets me better than JSON does?

Lately, I've fallen in love with YAML. Yes, she makes my config files more readable. But she does so much more than that.

What are you talking about, weirdo?

All apps have configuration. This is typically a file that contains the app settings, letting you make tweaks without digging into the code base.

Often, config files are written in JSON and look like this:
{
  "app": {
    "name": "DefenseAgainstTheDarkArts",
    "version": "0.0.1",
    "debug": false,
    "secret_key": "ssshhh-my-api-key-alohomora"
  },
  "server": {
    "host": "0.0.0.0",
    "port": 8080
  },
  "database": {
    "engine": "postgresql",
    "host": "db.hogwarts.edu",
    "port": 5432,
    "username": "dobby",
    "password": "sssshhh-secret-database-password",
    "name": "public",
    "pool_size": 10
  },
  "logging": {
    "level": "info",
    "format": "json",
    "output": "logs/app.log"
  }
}
But lately, I'm flirting with config files that look like this:
app:
  name: DefenseAgainstTheDarkArts
  version: 0.0.1
  debug: false
  secret_key: ssshhh-my-api-key-alohomora
server:
  host: 0.0.0.0
  port: 8080
database:
  engine: postgresql
  host: db.hogwarts.edu
  port: 5432
  username: dobby
  password: sssshhh-secret-database-password
  name: public
  pool_size: 10
logging:
  level: info
  format: json
  output: logs/app.log

Both config files have the same content. The app can read either file to see the server it's running on, how to access the backend database, and where to send the logs.

But these two beauties look different. Today we'll see the differences and why YAML is my new "main squeeze."

Cleaner Format

Readability is important. Code is written for machines, but we feeble humans need to read that code, too.

Look again at the two examples above. Notice what's missing from the YAML version?

  • No quotes
  • No braces ({})
  • No commas

YAML's already looking easy on the eyes. 😍

While JSON uses commas and braces to define structure, YAML's structure is built upon whitespace and indentation. It's subtle, but without these extra characters, I can read the file more quickly. And I don't have to worry about forgetting a comma or an unclosed brace when editing the file.

Quick side note: Both JSON and YAML are built on the concept of key-value pairs.

  • Keys usually name things you want to track. Keys appear before the colon (:). (See green-bold items in examples above)
  • Each key has a value which appears after the colon (:). A value can be a string, number, boolean (true/false), list, or even another collection of key-value pairs.

Okay, back to my raving: One of my favorite features of YAML is how she handles long strings. With JSON, you're stuck scrolling...

{
  "spells": {
    "patronus_charm": {
      "incantation": "Expecto Patronum",
      "description": "The Patronus Charm is an advanced defensive spell that conjures a positive energy force, often taking the form of an animal that represents the caster's spirit. It is primarily used to repel Dementors, creatures that feed on human happiness. Only those who can summon deeply positive memories can successfully cast a Patronus.\n"
    }
  }
}

But with YAML, you can break long, single-line strings across multiple lines with the fold operator (>):

# file: config_longstring.yaml
spells:
  patronus_charm:
    incantation: "Expecto Patronum"
    description: >                 # fold line-breaks into one line with `>`
      The Patronus Charm is an advanced defensive spell that conjures a
      positive energy force, often taking the form of an animal that
      represents the caster's spirit. It is primarily used to repel
      Dementors, creatures that feed on human happiness.
      Only those who can summon deeply positive memories can successfully
      cast a Patronus.

This gives you the best of both worlds:

  1. In your YAML file, you split the string among multiple lines to make the text more readable (no horizontal scrolling).
  2. When your app reads the file, it replaces the line breaks with whitespace, recreating the long, single-line string for use in the app logic.

Here's how python would interpret the file:

import yaml
with open(f"./config_longstring.yaml", "r") as f:
  config_yaml = yaml.safe_load(f)

# config file converted to python dictionary `config_yaml`: 
# line breaks replaced with whitespace
{
  'spells': {
    'patronus_charm': {
      'incantation': 'Expecto Patronum', 
      'description': "The Patronus Charm is an advanced defensive spell that conjures a positive energy force, often taking the form of an animal that represents the caster's spirit. It is primarily used to repel Dementors, creatures that feed on human happiness. Only those who can summon deeply positive memories can successfully cast a Patronus.\n"
    }
  }
}

BUT if you want to keep line breaks, you can use the pipe operator (|):

# file: config_longstring.yaml (updated)
spells:
  patronus_charm:
    incantation: "Expecto Patronum"
    description: |                 # preserve line breaks with `|`
      The Patronus Charm is an advanced defensive spell that conjures a
      positive energy force, often taking the form of an animal that
      represents the caster's spirit. It is primarily used to repel
      Dementors, creatures that feed on human happiness.
      Only those who can summon deeply positive memories can successfully
      cast a Patronus.

The YAML parser will read the string block literally and respect line breaks. Note how line breaks are identified by the new line character (\n) when the file loads again:

# config file converted to python dictionary: 
# line breaks preserved (\n)
{
  'spells': {
    'patronus_charm': {
      'incantation': 'Expecto Patronum',
      'description': "The Patronus Charm is an advanced defensive spell that conjures a\npositive energy force, often taking the form of an animal that\nrepresents the caster's spirit. It is primarily used to repel\nDementors, creatures that feed on human happiness.\nOnly those who can summon deeply positive memories can successfully\ncast a Patronus.\n"
    }
  }
}

This gives you options. You have more control over how the file is arranged for human readability and how the machine will interpret the file.

Comments

Want to use comments in your JSON config?

Too bad. You can't.

But you can with YAML. Just use the hash symbol (#):

app:
  name: DefenseAgainstTheDarkArts
  version: 0.0.1
  environment: development  # options: development, staging, production
  debug: true               # WARNING: set this to false before deploying
  secret_key: ssshhh-my-api-key-alohomora
server:
  host: 0.0.0.0
  port: 8080

Comments reveal a setting's possible values and draw attention to important sections. They share the intent and design of the config, which is like gold to other developers (and you 6 months from now).

Unfortunately, the JSON specification doesn't allow comments. In YAML, you can use comments to record what you were thinking when writing the file.

Re-Use Code

Do you like repeating yourself?

If so, keep using JSON:
{
  "services": {
    "api": {
      "host": "0.0.0.0",
      "port": 8080,
      "logging": {
        "level": "info",
        "format": "json",
        "output": "logs/app.log"
      }
    },
    "worker": {
      "concurrency": 4,
      "logging": {
        "level": "info",
        "format": "json",
        "output": "logs/app.log"
      }
    }
  },
  "databases": {
    "primary": {
      "engine": "postgresql",
      "host": "db.hogwarts.edu",
      "port": 5432,
      "username": "dobby",
      "password": "sssshhh-secret-database-password",
      "pool_size": 10,
      "name": "primary_db"
    },
    "replica": {
      "engine": "postgresql",
      "host": "db.hogwarts.edu",
      "port": 5432,
      "username": "dobby",
      "password": "sssshhh-secret-database-password",
      "pool_size": 10,
      "name": "replica_db",
      "read_only": true
    }
  }
}
If not, use YAML:
services:
  api:
    host: 0.0.0.0
    port: 8080
    logging: &default_logging # define logging node as anchor
      level: info
      format: json
      output: logs/app.log
  worker:
    concurrency: 4
    logging: *default_logging  # reuse logging settings

default_database: &default_db
  engine: postgresql
  host: db.hogwarts.edu
  port: 5432
  username: dobby
  password: sssshhh-secret-database-password
  pool_size: 10
databases:
  primary: 
    <<: *default_db  # reuse default_db config
    name: primary_db
  replica:
    <<: *default_db  # reuse default_db config
    name: replica_db
    read_only: true

YAML lets you reuse a node, or part of the config file. You first "anchor" the node and give it an alias with the & character. Later you reference that alias with the * character. The example above shows how to reuse log settings across services, enabling consistency and the DRY principle.

You can even merge an aliased node into another node that has extra key-value pairs. Check out our database config in JSON above. The primary and replica databases have many identical settings; only name and read_only differ. In the YAML version, common settings are in the default_database node, aliased as (&default_db). default_db is merged into the primary and replica settings with the << operator; right after that, the settings unique to each environment are defined (name and read_only).

YAML's anchors reduce future bugs. Let's face it: We've all introduced bugs when we updated the password for the primary database but forgot to update the password for the replica. With the YAML pattern above, we can safely update our config in one place and sleep better at night.


I can go on and on about my love, but I think you get it.

YAML is relatively new but has rapidly grown in popularity. Here's where you can find her in the wild:

JSON is a long-time industry standard and used to transmit data between machines. I'll still use JSON for my projects. But where possible, I'm going to flirt with YAML for a more human touch.

If you're a nerd like me, read the YAML spec (i.e. the boring manual) at yaml.org.

And if you want training on software design, call me. Like YAML, you'll find me... approachable.