Data Engineer

Sancare, Paris, FR
June 2023 - Present
- Design and implement robust ETL pipelines for healthcare data with Python and SQL, ensuring data integrity and system scalability.
- Data recovery from complex hospital databases.
- Indexing and extraction of a wide range of textual data (PDF, DOC, HTML, CSV, HL7 FHIR, ...).
- Implement Optical Character Recognition (OCR) technology to extract and digitize data.
- Optimising queries using the SQL execution plan.
- Investigation, repair and improvement of legacy code.
- Contribute to clinical studies by providing data engineering support, ensuring accurate and timely data processing for research purposes.
- Collaborate with clients to define product specifications, translating business needs into technical requirements.