Useful Resources
During consultation, workshops, and training, mention is often made to research papers, websites, or tools. This curated index compiles those references to make them available to others.
No matching resources found.
Ai Ml
-
3Blue1Brown: Neural Networks
Visual, intuitive explanations of neural network fundamentals
-
CS231n: Convolutional Neural Networks for Visual Recognition
Stanford’s in-depth course notes on neural networks, activations, and backpropagation
-
DeepLearning.AI
Andrew Ng’s platform for practical AI/ML courses and specialization tracks
-
Distill
Interactive, visually rich articles explaining ML concepts in depth
-
fast.ai
Practical deep learning courses and libraries for quickly building models
-
Google Colab
Free cloud-based Jupyter notebooks with GPU/TPU support
-
Hugging Face
Community-driven hub for pretrained models, datasets, and tools for NLP and beyond
-
Jupyter Notebook
The de-facto standard for web-based interactive development in data science
-
Kaggle
Datasets, competitions, and GPU compute for notebooks
-
KDnuggets
News, tutorials, and curated content on data science and machine learning
-
ML Cheatsheets
Concise reference sheets for key machine learning algorithms and math
-
Papers with Code
Browse machine learning research papers alongside their implementations and benchmarks
-
PyTorch
Popular deep learning framework for building and experimenting with neural networks
-
TensorFlow Playground
Interactive visualizer for experimenting with perceptrons, activation functions, and multi-layer networks
Aws Tools
-
AWS CLI
The official CLI for AWS.
-
AWS CloudShell
A browser-based shell for managing AWS resources without local setup
-
AWS SDKs
Official software development kits for integrating AWS services in various programming languages
-
AWS Workshops
Hands-on labs and guided workshops for learning AWS services through real-world scenarios
-
aws-nuke
Open-source tool for removing all resources from an AWS account quickly and safely
-
Cloudsplaining
Analyzes AWS IAM policies for security risks and over-permissive roles
-
eksctl
Open-source CLI for creating and managing Kubernetes clusters on AWS EKS
-
LocalStack
A free, open-source emulator for AWS services to develop and test locally
-
Prowler
Open-source security tool for auditing AWS accounts using CIS benchmarks and best practices
-
saml2aws
Command-line utility for authenticating to AWS with SAML-based Single Sign-On
Blog Posts Articles
-
AI and the Alignment Problem
Collection of essays exploring long-term AI safety and alignment challenges.
-
Amazon Face Recognition Ban
Bias and ethics in real-world AI deployments.
-
Dead Cat Syndrome
Classic blog on the transition from projects to BAU support.
-
GraphQL Primer Part 2
Introduction to GraphQL for REST-experienced engineers.
-
Martin Fowler: Microservices
Definitive primer on the principles and trade-offs of microservices architecture.
-
Site Reliability Engineering: Google’s Approach
Google's free-to-read book introducing SRE principles and practices.
-
The Cathedral and the Bazaar
Eric Raymond’s classic essay on open-source development methodologies.
Books
-
Accelerate
Research-driven insights into software delivery performance and DevOps capabilities.
-
AI in Museums
How AI/ML is used in museums to enhance visitor experiences.
-
Continuous Delivery
Jez Humble's foundational text on CI/CD practices for reliable software delivery.
-
Deep Learning (Goodfellow, Bengio, Courville)
Comprehensive introduction to deep learning theory and practice, available free online.
-
Designing Data-Intensive Applications
In-depth exploration of building scalable, reliable data systems (paid, with free chapters online).
-
Machine Learning Yearning
Andrew Ng's practical guide to structuring and improving machine learning projects.
-
Site Reliability Engineering
Google’s complete SRE handbook, freely available online.
-
The DevOps Handbook
Comprehensive guide to implementing DevOps principles across the software delivery lifecycle.
-
The Phoenix Project
A novel about DevOps transformation and the three ways of flow, feedback, and learning.
C Sharp Ecosystem
-
.NET (Core / 5+)
The cross-platform, open-source framework for building modern C# applications.
-
ASP.NET Core
Framework for building web apps, APIs, and microservices using C# and .NET.
-
AutoMapper
Convention-based object-to-object mapper for reducing boilerplate in data transfer.
-
ConnectWise Automate SDK
A useful SDK to add functionality to the ConnectWise Automate Remote Monitoring and Management (RMM) application via plugins.
-
Entity Framework Core
Object-relational mapper (ORM) for C# applications, simplifying database interactions.
-
Polly
Resilience and transient fault-handling library for .NET (retries, circuit breakers, etc.).
-
Serilog
Structured logging library for C# with powerful sinks and configuration options.
-
xUnit.net
Popular open-source testing framework for C# applications.
Ci Cd Platforms
-
Argo Workflows
Kubernetes-native CI/CD and workflow engine for running complex pipelines.
-
Azure DevOps Pipelines
CI/CD tools from Microsoft Azure.
-
CircleCI
Cloud-based CI/CD platform with strong Docker and Kubernetes support.
-
GitHub Actions
CI/CD automation on GitHub.
-
GitLab CI
GitLab's integrated CI/CD pipeline system.
-
Spinnaker
Open-source, multi-cloud continuous delivery platform.
-
Tekton
Open-source framework for building Kubernetes-native CI/CD pipelines.
-
Travis CI
Hosted CI/CD service for building and testing software projects integrated with GitHub.
Containerisation
-
Anchore Engine
Policy-based analysis and scanning of container images for vulnerabilities and compliance.
-
Clair
Static vulnerability analysis tool for container images used in registries like Quay.
-
containerd
Industry-standard container runtime used in Docker and Kubernetes environments.
-
CRI-O
Lightweight container runtime for Kubernetes workloads.
-
Docker
The most popular container platform for building, shipping, and running applications.
-
DockerHub
The go-to place for open-source containers
-
Helm
Package manager for Kubernetes applications, simplifying deployment and updates.
-
Kind (Kubernetes in Docker)
Lightweight Kubernetes clusters for testing using Docker containers.
-
Kubernetes
The de-facto standard for container orchestration across clusters.
-
Minikube
Tool for running a local single-node Kubernetes cluster for development and testing.
-
Podman
Daemonless, open-source container engine with strong Kubernetes compatibility.
-
Trivy
Open-source vulnerability scanner for container images, file systems, and IaC.
Documentation Tools
-
BookStack
Open-source, self-hosted platform for organizing documentation as books with chapters and pages.
-
Confluence (Free Tier)
Team-oriented wiki/documentation platform with strong integrations (offers a free plan for small teams).
-
Docusaurus
Open-source static site generator by Facebook, optimized for project documentation.
-
GitBook (Free for Open Source)
Cloud-based documentation platform with collaborative editing and GitHub sync.
-
Mermaid
Markdown-like syntax for generating diagrams and flowcharts in docs.
-
mike
Version manager for mkdocs sites.
-
mkdocs
Static site generator using Markdown.
-
Notion (Free for Personal Use)
Collaborative workspace for notes, docs, and lightweight knowledge bases with templates and integrations.
-
PlantUML
Text-based UML diagramming tool for embedding diagrams directly in documentation.
-
Read the Docs
Free hosting for Sphinx and MkDocs projects with automatic builds and versioning.
-
Redoc
Clean, responsive API documentation from OpenAPI definitions.
-
Sphinx
Python-based documentation generator, widely used in scientific and software projects.
-
Swagger UI / OpenAPI
Interactive API documentation generator from OpenAPI specs.
-
Wiki.js
Modern, open-source wiki software with Git-backed content storage and Markdown support.
Infrastructure Management
-
Ansible
Configuration management for deployed infrastructure
-
ArgoCD
Declarative GitOps continuous delivery for Kubernetes.
-
Conftest
Policy testing tool for validating configuration files using Open Policy Agent.
-
Consul
Service discovery, configuration, and segmentation for dynamic infrastructure.
-
Crossplane
Kubernetes-based control plane for managing cloud and on-prem infrastructure via CRDs.
-
Infracost
Cost estimation tool for Terraform, showing cloud cost impacts before deployment.
-
k9s
TUI for Kubernetes multi-cluster management
-
Kubernetes (K8s)
Orchestration of containers at scale
-
Nomad
Lightweight orchestrator for containers, VMs, and batch workloads.
-
Open Policy Agent (OPA)
Policy-as-code framework for enforcing governance across infrastructure and applications.
-
OpenTofu
Community-driven fork of Terraform providing open governance for IaC.
-
Packer
Automates building machine images for multiple platforms from a single source configuration.
-
Pulumi
Infrastructure-as-code using familiar programming languages like Python, TypeScript, and Go.
-
Terraform
Declarative language for orchestrating infrastructure
-
Vagrant
Management of development environments via virtual machines
-
Vault
Secrets management, encryption, and identity-based access for infrastructure.
Platform Engineering
-
Backstage
Open-source framework for building developer portals.
-
Humanitec
Platform Orchestrator for building internal developer platforms with dynamic environment management.
-
Keptn
Control plane for automating delivery and operations workflows in a platform context.
-
KrakenD
Open-source API gateway useful for building internal platforms with aggregated services.
-
Port
Self-service platform for defining and operating developer portals and golden paths.
-
Score
Specification for simplifying application workload definitions across environments.
Python Ecosystem
-
bandit
Static security analysis tool for Python codebases.
-
boto3
AWS SDK for Python.
-
Celery
Distributed task queue for asynchronous processing in Python.
-
coverage
Code coverage measurement tool.
-
Django
Full-featured Python web framework for rapid development.
-
FastAPI
High-performance web framework for APIs.
-
FastStream
Modern Python framework for building async message-driven applications with Kafka, RabbitMQ, and NATS.
-
Flask
Lightweight web framework for Python.
-
Invoke
Task execution tool for Python, similar to Makefiles but in Python.
-
Jinja
Templating engine for generating documents or configuration.
-
mypy
Static type checker for Python, helping enforce type safety in codebases.
-
pandas
Data analysis and manipulation library.
-
peewee
Small, expressive ORM for Python.
-
Plotly
Interactive graphing library for Python with web-ready visualizations.
-
Poetry
Dependency management and packaging tool for Python projects.
-
pylint
Static code analysis and style enforcement.
-
PySpark
Python API for Apache Spark for large-scale data processing.
-
pytest
Python's standard unit testing library.
-
Python StateMachine
Easy-to-use state machine library for Python.
-
pyvmomi
VMware vSphere Management SDK.
-
Quart
Flask-compatible async web framework supporting WebSockets and HTTP/2.
-
Requests
Simplified HTTP requests in Python.
-
scikit-learn
Machine learning library with algorithms for classification, regression, and clustering.
-
seaborn
Data visualization library built on matplotlib.
-
SQLAlchemy
Advanced ORM and SQL toolkit.
-
Starlette
Lightweight ASGI framework for building high-performance async web services.
-
tensorflow
End-to-end machine learning framework.
-
Typer
CLI apps with Python and automatic docs generation.
Research Papers
-
Aerial Imagery Tree Classification
Applies deep learning to satellite imagery for classification.
-
Attention is All You Need
Introduced transformer architectures — foundational to modern LLMs.
-
Big Ball of Mud
Explores the chaotic architecture style still prevalent in the industry.
-
Brewer's CAP Theorem (2000)
Foundational paper on the trade-off between consistency, availability, and partition tolerance in distributed systems.
-
CNN + LSTM for Sentiment Analysis
Arabic sentiment analysis using a hybrid CNN and LSTM model.
-
Conway's Law (1968)
Original articulation of the principle that system design mirrors organizational communication structures.
-
Distilling the Knowledge in a Neural Network (2015)
Hinton et al. introducing knowledge distillation for transferring knowledge between models.
-
Efficient BackProp (1998)
Yann LeCun’s guidelines for improving backpropagation training efficiency, including insights on activation functions
-
Ethical Principles in ML/AI
Discusses trade-offs between interpretability and power in ML.
-
Explainable Deep Learning
Methods for making DNNs explainable in real-world applications.
-
Explaining and Harnessing Adversarial Examples (2015)
Goodfellow et al. introducing adversarial examples and their implications for neural network robustness.
-
Fast and Accurate Deep Network Learning by Exponential Linear Units (2015)
Clevert et al. proposing ELU activation for faster, more stable deep network convergence
-
Identifying Technical Debt
Compares approaches to identifying technical debt in real projects.
-
ImageNet Classification with Deep Convolutional Neural Networks (2012)
Krizhevsky, Sutskever, and Hinton’s AlexNet paper demonstrating the power of deep convolutional networks
-
Improving Code Maintainability
Impact of refactoring on long-term code health and maintainability.
-
Learning Representations by Back‑propagating Errors (1986)
Rumelhart, Hinton, and Williams’ seminal paper introducing backpropagation for training multi-layer networks
-
Mastering the Game of Go with Deep Neural Networks and Tree Search (2016)
AlphaGo paper combining deep learning with Monte Carlo tree search to beat human champions at Go.
-
Mish: A Self Regularized Non‑Monotonic Neural Activation Function (2019)
Diganta Misra introducing the Mish activation function for improved gradient flow and performance
-
On the Dangers of Stochastic Parrots
Risks of large black-box models in NLP and their societal impact.
-
Perceptrons (1969)
Minsky and Papert’s book formalizing perceptron limitations and influencing the development of multi-layer networks
-
Playing Atari with Deep Reinforcement Learning (2013)
DeepMind’s introduction of deep Q-networks (DQNs), applying deep learning to reinforcement learning problems.
-
Rectified Linear Units Improve Restricted Boltzmann Machines (2010)
Nair and Hinton’s paper introducing ReLU activation for improving deep learning performance
-
Rectifier Nonlinearities Improve Neural Network Acoustic Models (2013)
Maas et al. introducing Leaky ReLU for mitigating dead neurons in rectified activations
-
Retrieval-Augmented Generation
RAG architecture for external knowledge retrieval in LLMs.
-
Software Aging
Discusses software aging and strategies to mitigate its effects.
-
The Lottery Ticket Hypothesis (2019)
Frankle and Carbin’s influential work on network pruning and the existence of small trainable subnetworks.
-
The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain (1958)
Frank Rosenblatt’s original paper introducing the perceptron — a foundational neural network model
Software Deployment
-
Argo Rollouts
Kubernetes controller for progressive delivery (blue/green, canary, and A/B testing).
-
Capistrano
Remote server automation and deployment tool, widely used for Rails and other apps.
-
Flagr
Open-source feature flagging and A/B testing service for controlled releases.
-
FluxCD
GitOps-based deployment tool for Kubernetes clusters.
-
Kustomize
Template-free way to customize Kubernetes application configurations.
-
LaunchDarkly
Feature management platform for progressive rollouts and experimentation.
-
Octopus Deploy (Community Edition)
Deployment automation platform with support for multi-environment and multi-cloud delivery.
-
Serverless Framework
Framework for deploying serverless applications across cloud providers.
Source Control Platforms
-
Azure Repos
Microsoft's source control service with Git and TFVC support, part of Azure DevOps.
-
Bitbucket
Source control platform from Atlassian.
-
Gerrit Code Review
Web-based code review tool tightly integrated with Git, popular in large-scale projects.
-
Gitea
Lightweight, self-hosted Git service with issue tracking and CI integration.
-
GitHub
The world's most popular code hosting platform.
-
GitLab
DevOps and source control platform.
-
SourceHut
Minimalist, open-source platform for Git hosting, CI, and mailing lists.
Standards Best Practices
-
Architecture Decision Records (ADRs)
Lightweight documentation practice for capturing architecture decisions.
-
CNCF Cloud Native Security Whitepaper
Guidance on securing cloud-native applications and infrastructure.
-
GDPR (General Data Protection Regulation)
EU regulation on data protection and privacy for individuals.
-
GitHub Well-Architected Framework
GitHub-specific SDLC best practices.
-
Google SRE Principles
Foundational principles and practices for operating reliable distributed systems.
-
ISO/IEC 27001
International standard for information security management systems.
-
ISO/IEC/IEEE 26515:2018
Agile documentation standards.
-
Monitoring Distributed Systems
Google's view on observability and golden signals.
-
NIST Secure Software Development Framework (SSDF)
NIST’s guidance for integrating security practices across the software development lifecycle.
-
OWASP Top Ten
Standardized list of the most critical web application security risks.
-
PCI DSS
Security standards for organizations that handle credit card information.
-
The Good Docs Project
Open-source documentation templates.
-
The Twelve-Factor App
Best practices for building scalable, maintainable SaaS applications.
Terraform Ecosystem
-
Atlantis
Self-hosted pull request automation for Terraform plans and applies.
-
Checkov
Open-source static analysis tool for detecting misconfigurations in Terraform, Kubernetes, and cloud IaC.
-
infracost
Estimate Terraform resource costs via static analysis.
-
OPA (Open Policy Agent) with Terraform
Policy-as-code enforcement for Terraform configurations.
-
Pre-Commit Terraform Hooks
Pre-commit hooks for formatting, validation, and security scanning of Terraform code.
-
Terraform Module Registry
Official registry of reusable Terraform modules for AWS, Azure, GCP, and other platforms.
-
Terragrunt
Thin wrapper for Terraform providing DRY configurations, remote state management, and multi-environment support.
-
tfenv
Terraform version manager.
-
tflint
Terraform linter.
-
Tfsec
Static analysis tool to detect security issues in Terraform configurations.
Testing Frameworks And Software
-
Behave!
A behavioural testing framework built in Python, ideal for end-to-end infrastructure and ETL testing.
-
Cypress
End-to-end testing framework for modern web applications with real-time debugging.
-
Grafana k6
A framework to define performance testing in JavaScript/TypeScript
-
Hypothesis
Property-based testing framework for Python, generating dynamic test cases.
-
Jest
Popular JavaScript testing framework with built-in mocking and snapshot testing.
-
JUnit
The standard unit testing framework for Java applications.
-
Lighthouse
Analyzes webpage performance, accessibility, best practices and SEO
-
OWASP ZAP
Open-source security testing tool for web applications, focusing on vulnerability scanning.
-
Postman
Collaborative API testing and monitoring tool with scripting capabilities.
-
pytest
Feature-rich testing framework for Python, supporting fixtures and plugins.
-
Selenium
Framework for automating web browsers, widely used for UI testing.
-
Terratest
A mature Go framework for writing Infrastructure tests, integratable with Terraform pipelines.