SISTRIX GmbH Logo

SISTRIX GmbH

Bonn

Software Craftsman & DevOp - SEO

Software Craftsman | DevOp

September 2016 - January 2018
1 year 5 months
full-time
Project
Bonn
🎯

Overview

As Software Craftsman & DevOp at SISTRIX, I built Big Data pipelines for SEO analysis with "You Build It, You Run It" methodology. CI/CD pipeline with Jenkins/Docker, data extraction with Spark/Hadoop for 450M+ keywords worldwide, and PaaS architecture with Apache Mesos/Marathon.

HTML parser with XPath for effective structuring of millions of keywords per country (Germany: ~100M, USA: ~52M), HTML crawler for 200M seeds, and automated acceptance tests for SaaS tools. Status dashboard with Play Framework for project transparency and operational overview.

Activities

  • Big Data Pipeline Development: HTML parser with XPath for millions of keywords per country, API integration
  • Data Extraction: HTML crawler with Spark/Hadoop for 200M seeds, structured data extraction
  • DevOps & CI/CD: "You Build It, You Run It" pipeline with Jenkins/Docker, automated deployments
  • PaaS Architecture: Apache Mesos/Marathon, AWS Route 53, scalable infrastructure
  • Quality Assurance: Automated acceptance tests for SaaS tools, Cucumber testing
  • Monitoring & Operations: Status dashboard with Play Framework, operational transparency
🔄

Methodology

  • "You Build It, You Run It": DevOps Culture, End-to-End Ownership
  • Big Data Processing: Spark/Hadoop, Scalable Data Pipelines
  • CI/CD: Automated Testing, Continuous Deployment
  • PaaS Architecture: Container Orchestration, Service Mesh

Technology Stack

Technologies and tools used in this project

⚙️

Backend

4
Java logo
Java
Scala logo
Scala
Play Framework logo
Play Framework
ArangoDB logo
ArangoDB
📦

Other

6
Akka
Data Storage
XPath
Cucumber
Testing
Quality Assurance
📊

Data & AI

5
Spark logo
Spark
Hadoop logo
Hadoop
Big Data Processing logo
Big Data Processing
Container Orchestration logo
Container Orchestration
MySQL logo
MySQL
🚀

DevOps

5
Docker logo
Docker
Jenkins logo
Jenkins
CI/CD Pipeline
Apache Mesos logo
Apache Mesos
Marathon logo
Marathon