Who are we looking for
- We are looking for a Research Engineer where you will be working on research and development of cutting edge synthetic data generation techniques in our corporate lab with a university.
- Work closely with a world-class engineering team alongside software and AI/ML engineers to develop and incorporate algorithms into our AI synthetic data generation platform.
- This position is non-remote in Singapore.
What you will be doing
Responsibilities
- Assist in conducting research and development of state-of-the-art generative models for tabular synthetic data including using cutting edge technologies such as Generative Adversarial Networks (GANs), language models (transformers, LLMs), diffusion models, etc.
- Support and maintenance of research and web application software including performing experiments, building and deploying ML models, building microservices, building ml pipelines, developing user interfaces or visualisations.
- Come up with new ideas or improve upon existing ideas in synthetic data generation including using generative models, language models, privacy preserving technologies or ethical/fair AI.
- Develop and improve metrics to measure the quality of synthetic data including its utility and privacy.
- Work closely with engineers in the end-to-end development of incorporating the algorithms into our synthetic data platform.
- Opportunity to assist in publishing high quality research papers and publications in top conferences and journals.
- Present, disseminate and explain our work at internal and external events.
Qualifications
Requirements
- Undergraduate degree and above in Computer Science, Machine Learning, Statistics, Mathematics or a related field (e.g. Electrical Engineering).
- Fluency in Python which we predominately work with. Basic use of other programming languages (Javascript, Typescript, C++) which we occasionally use would be good to have.
- An understanding of the importance of good practices for producing reliable software and reproducible analyses (e.g. version control, issue tracking, automated testing, package management).
Good to have
- Experience in one or more of the following areas:
- Generative models (e.g. GANs, transformers, language models) - Strong plus
- Deep learning / Neural networks
- Privacy preserving technologies (differential privacy, trusted execution environments, blockchain)
- Fair / Ethical AI
- Data Science