"It is a capital mistake to theorize before one has data. Insensibly one begins to twist facts to suit theories, instead of theories to suit facts"

-- Sherlock Holmes (Arthur Conan Doyle, 1892)

Profession

I'm a self-employed Data Scientist (Type B). My professional focus is currently on advanced analytical problems, which can be solved by methods of Machine Learning, especially in the fields of language and text.

Methods:

  • Data Analytics
  • Statistical Modeling
  • Data Visualization
  • Machine Learning, including Deep Learning
  • TrustworthyAI
  • Natural Language Processing

Toolbox

Data Science:

  • PyData Stack (Anaconda, Jupyter, Pandas, Numpy, scikit-learn, ...)
  • spaCy, NLTK
  • Elasticsearch, MongoDB, PostgreSQL/ PostGIS, SQLite
  • Keras, TensorFlow, Transformers, MLflow
  • QGIS, Folium
  • Apache Spark

Software Development:

  • Git
  • Python, PyCharm
  • Scala, Akka, SBT, IntelliJ
  • Flask, Pelican, Node.js, JS, HTML, Bootstrap (all of them, only if necessary)
  • Linux, Bash
  • AWS, GCP, IBM Cloud

Certificates and Achievements

Showcase of certificates, I collected during the last years (there are some more, >40):

Recent achievements: