Hand-Rolling My Own Static Site Generator
The personal blog you are reading was initially built with the Hugo Static Site Generator(SSG). Last year, when I started this personal blog, that was the right choice as it allowed me to focus on writing and organising my posts. I like having maximum control over my tools, and Hugo has since gotten in my way a few times. I also needed to learn how to use Hugo to get the most from it. I believed that for my simple needs, I needed a lightweight alternative that allowed me to bend it to my will as much as I liked. This is the motivation for writing my static site generator.
This post will perform a bird's-eye view of the implementation and functionalities of the static site generator(referred to as SSG for the rest of the post) I wrote to satisfy the need outlined in the previous paragraph.
An SSG is a simple project. The kind you knock off in an evening using tools you are familiar with. I did just that with this project. This SSG is written in Python, a language I use heavily at work and play.
Dependencies
This project relies on the following Python libraries:
- markdown: converts markdown files to HTML
- pyyaml: parses YAML front matter in blog posts
- jinja2: for HTML templating
- pygments: for code syntax highlighting
- feedgen: generates RSS feeds
- watchdog: enables live reloading during development
- pytest: the testing framework
With these libraries in place, the remaining effort is to write the code that orchestrates how markdown files are converted to HTML files, along with other blogging convenience features like enforcing a tags system, RSS feeds, sitemaps and code syntax highlighting.
Generating HTML from markdown
I write my blog posts in markdown. It's a convenient file format. The main task the SSG would be performing is to convert these markdown files into HTML files for the browser. I created three classes for organising the data: Post
, Tag
and Pages
. They are container classes and allow me to pass data around conveniently.
A blog post(like this one you're reading) starts its life as a markdown file in the posts
folder. All the markdown files in the posts
folder get read and marshalled into the Post
class. From the Post
classes created from all the markdown files, tags are extracted and marshalled into Tag
classes. The Pages
class organises tags and posts for the index page. The flow of this process can be seen in the main
build function:
def main():
"""The build function.
Uses all the right classes and utilities to perform markdown to HTML conversion.
Also generates the CSS, RSS feeds and sitemap.xml files.
"""
# Get project root directory
project_root = Path(__file__).parent.parent
# Setup paths
posts_dir = project_root / "blog" / "posts"
static_dir = project_root / "static"
templates_dir = project_root / "blog" / "templates"
css_dir = project_root / "static" / "css"
# Cleanup previously generated files
cleanup_generated_files(static_dir)
# Generate CSS files
try:
css_generator = CssGenerator(str(css_dir))
css_generator.generate_pygments_css()
except Exception as e:
print(f"Warning: Failed to generate CSS: {e}")
try:
# Build pages from markdown files
builder = PageBuilder(str(posts_dir))
pages = builder.build_pages()
# Write all pages
writer = PageWriter(
output_dir=str(static_dir / "posts"),
templates_dir=str(templates_dir),
site_url="https://www.oyeoloyede.com"
)
successful_count = writer.write_all(pages)
# Print final summary
print(f"\nTotal posts generated: {successful_count}")
return 0 if successful_count > 0 else 1
except Exception as e:
print(f"\nError during processing: {str(e)}")
return 1
The flow of this function is as follows:
- Initialize the relevant directories.
- Cleanup the files generated from the last build with the call to
cleanup_generated_files
- Create CSS files using the
CssGenerator
class. - Build the posts from markdown files. The
PageBuilder
class uses the markdown library to convert the markdown files into HTML strings. Tags and post metadata(title, date, etc) are also embedded into the markdown files as YAML front matter so the pyyaml library parses this front matter to a Python dict.PageBuilder
delegates this coordination of extracting metadata and post content from the markdown file to aMarkdownConverter
class. - The pages produced by
PageBuilder
are passed toPageWriter
which, as you would expect, writes out all the blog content, including RSS feeds and sitemap files.PageWriter
delegates building sitemaps and RSS feeds to theSitemapGenerator
andBlogFeedGenerator
classes respectively. jinja templates are used for writing out the HTML files to enable as much reuse as possible, and there is a dedicatedTemplateHandler
class thatPageWriter
employs for handling template-related tasks. - Report the result of the build.
Styling
A style.css
file is maintained in the /static/css
folder. This file provides the styling for the entire blog. This is the only hand-written file in the static
folder and sub-folders.
Code syntax highlighting is provided by the codehilite markdown extension and a styling sheet is generated by the pygments library.
Development live server
Using the watchdog library, a development live server is set up. This also includes live reloading and building the posts when a change is made to any markdown file. This makes for a pleasant writing experience as I can preview my content as I write.
Conclusion
My requirement of having a minimalist SSG capable of providing my blogging needs has been satisfied with this project. It is also a fun little distraction for an evening. In less than the time it would have taken me to get acclimatized with the documentation of any of the free, popular open-source static site generators, I have spun up a tiny SSG that meets my needs. This is also something that can be hacked and remoulded to meet any future needs. I suspect there will be more SSG hand-rolling in my future :)
You can find the source code of the SSG described in this post on my GitHub.