Data Scientist
Summary
Endre Palatinus is a Co-Founder and Data Scientist at d:AI:mond GmbH. He enjoys solving data science problems in R and Apache Spark, in particular, scientific data analysis and optimizing the performance of big data systems.
Education
PhD in Computer Science (Big Data Analytics), Saarland University, 2011 – 2016
MSc in Computer Science (Software Engineering), University of Szeged, 2009 – 2011
BSc in Computer Science, University of Szeged, 2006 – 2009
Experience
Co-founder & Data Scientist, d:AI:mond GmbH, 2008-present:
Served as co-CEO. Responsible for B2B sales and finances. Worked as a data scientist.
Project: Data Integration in the Chemical Industry
I have completed a large-scale data integration and data mining project for a German chemical giant. I have written software for extracting ingredient lists and measurement results from wildly different Excel Vles with no Vxed schema. This involved using data mining and knowledge extraction techniques, as well as setting up ETL pipelines. I have closed the deal and was responsible for all parts of the project, including requirements specification, budgeting, and implementation.
Project: Quality Assurance and Performance Tuning of a Time-Series Library
I have completed a project for a German premium car manufacturer, where I have tested and tuned their big data system for storing and analyzing sensor data of prototype vehicles. This involved writing unit tests in Scala for a distributed system based on Apache Spark. In the performance tuning phase of the project, I have applied my research expertise in the big data analytics area and achieved a factor 5 performance boost. The system is used for crunching petabyte-scale datasets, and thus it required large-scale automatic VM orchestration and system setup. This project involved diving into the CAN bus architecture for sensor fusion, and system validating using signal temporal logic (STL) as well.
Project: Building a recommendation engine for wines
Implemented a hybrid item- and content-based recommendation engine using collaborative-filtering for wines. Deployed into production on Amazon’s private recommendation engine.
Project: Dynamic Pricing in the Hospitality Industry
Implemented data-pipelines from various booking, pricing and payment providers. Created dashboards for data-driven decision making. Implemented a model for dynamic pricing.
Skills
- R
- Shiny
- Spark
- Scala
- SQL
- Java
- C/C++
- Docker
Specialties
- Database Technology, Research
Spoken Languages
- English (C1), German (fluent), Hungarian (native)