About the Job
We are looking for a Data Engineer (f/m/d) with experience on web crawling and PDF parsing to join our Certivity team.
Your mission
Certivity is a tech startup that digitizes the way how product makers manage regulatory requirements to be compliant. In this role you will experiment with text generation, summarisation, search and question answering using the newest technologies. You will also transform successful experiments into product features loved by our customers. You will have a chance to apply cutting edge NLP techniques to real-world problems in the regulatory domain. As a part of our small (but growing) team you will have a unique opportunity to shape the product and the company.
Your profile
As a Data Engineer (m/f/d) at Certivity, your responsibilities within the data team will be to extract data from websites and PDFs using Python, build data pipelines in the cloud with Azure, and store and manage data in MongoDB.
Your profile:
- You are a Python expert (C++ is a plus);
- You have experience in web scraping and crawling;
- You have demonstrable experience with PDF parsing;
- You have hands-on experience with cloud applications (Azure is preferred);
- You are familiar with NoSQL databases (MongoDB is preferred);
- You are in Munich, of based in Germany;
- Fluent in spoken and written English (company language is English)
About us
Founded in 2021, we at Certivity are a young, creative, and dynamic team of highly skilled domain experts working on innovative RegTech Software. Our solution will shape and define the term “RegTech” as we develop a completely new product for the highly complex and not yet digitized environment of product regulations. Our primary focus is the automotive industry, with options to expand to many other domains as well. Our B2B cloud platform bundles all relevant tasks for compliance with vehicle regulations and laws. For more information about us and our product have a look at our website.