Passende Schulungen
finden Sie hier:
↪ Microsoft Azure
↪ Microsoft Certified: Azure Enterprise Data Analyst Associate
↪ Data Mining - Machine Learning
↪ AWS Certified Machine Learning - Specialty (ACMLS)
↪ Digitalisierung
↪ Microsoft Dynamics 365 Finance & Supply Chain Management
↪ Microsoft System Center 2016
↪ Microsoft Exchange Server 2013
↪ SQL Server 2016
↪ SQL Server 2017
↪ Python
↪ NoSQL
↪ MySQL
↪ SQL Server
↪ Azure Cosmos DB
↪ Azure HDInsight
↪ Docker
↪ Agilität
↪ Git
↪ Microsoft Certified: Azure AI Engineer Associate
↪ Python Zertifizierung
↪ FileMaker
↪ WatchGuard
↪ Künstliche Intelligenz

Architect (H/U/M/A/N)* Data Science Python PySpark, Scikit-Learn, XGBoost, MLflow, Matplotlib #9807


1 keine Angabe keine Angabe Wochen
Registrieren Sie sich jetzt kostenlos
um Ihre Anfrage versenden zu
können!

Oder loggen Sie sich ein!
Anfrage senden

Dieses Projekt wurde bereits erfolgreich vermittelt!
Projektdetails
Projektbeginn: 06.07.2023
Abrechnung nach: nach Vereinbarung
Nebenkosten abrechenbar: Nein
Projektvolumen: keine Angabe

Projektbeschreibung
  • Good communicator, but great at independent work. Solution oriented. Analytical thinking
  • Experience in Data Science, machine learning, statistical modelling, multivariate analysis, exploratory data analysis, software development and data engineering
  • Optimally, candidate should have a degree in mathematics, physics, computer sciences or in a related field
  • Several years of experience in Python programming are required, especially in working with PySpark, Scikit-Learn, XGBoost, MLflow, Matplotlib and related libraries
  • Good familarity with Azure, Git, GitLab, CI/CD, Docker and Databricks are beneficial.
  • Candidate should be familiar or inclined to working in an agile environment.
  • Prior experience with predictive maintenance tasks is a plus.

Ihre Aufgaben

Our customer has five different machine learning-based solutions that identify weak points in the medium voltage grid. While targeting the same purpose, they differ in their data source systems, ML features, ML algorithms and especially the Distribution System Operator (DSO) for which they have been developed. Currently, none of these five solutions is easily applicable to a different target DSO or implemented in a scalable way. A newly developed data platform (iPEN) that collects, prepares, and provides data from all DSOs now allows to develop a new solution that consumes all needed data from a single source system. Based on unified data preparation, feature extraction, and machine learning steps, this new solution should be applicable to all customers DSOs. Particular attention will be paid to the scalability, stability, and maintainability of the resulting software. Data Science tasks related to projects, e.g. Predictive Maintenance Solutions.
  • Advice and design business-critical data science use cases, from the business problem to delivery and operation.
  • Statistical analysis and exploration of static, mixed, and time-series data.
  • Writing of exploration and production code in Python and PySpark on a Databricks Tech-Stack.
  • Design and implementation of ML algorithms related to failure predictions.
  • Improvement and optimization of existing ML algorithms using various tuning techniques.
  • Study of new features, feature importances, correlations, causations, data leakage, up- and downsampling.
  • Advice data engineers in writing production code for feature engineering.

Kontakt

Cegeka Deutschland GmbH
Manuela Fentrohs
Putzbrunner Str. 71
81739 München
Tel. +49 89 74 833 873
Fax +49 89 74 833 920
manuela.fentrohs@cegeka.com

Sprachkenntnisse
Deutsch
Experte
Kenntnisse & Fähigkeiten
Agilität
Grundkentnisse
COM/DCOM
Grundkentnisse
Docker
Grundkentnisse
git
Grundkentnisse
Machine Learning
Grundkentnisse
Microsoft Azure
Grundkentnisse
Python
Grundkentnisse
SQL Azure
Grundkentnisse