Who Are We Looking For:

We are seeking a proactive Large Language Model (LLM) Engineer to lead the development of text generation systems. The ideal candidate will possess a deep understanding of deep learning and advanced NLP techniques.

Key Responsibilities:

Architect and fine-tune large language models to generate synthetic text grounded to the statistics of a set of tabular data.
Collaborate with cross-functional teams, including AI researchers and engineers, to integrate LLM solutions into existing synthetic data generation pipelines.
Stay updated on the latest advancements in LLMs, ensuring our solutions remain at the forefront of technology.
Implement prompt engineering strategies to enhance the quality and relevance of generated text.
Be familiar with finetuning open weights model like Qwen-2, Llama -3 , Mistral for Instruction finetuning and DPO.
Be familiar with training frameworks like TRL, Unsloth , Hugging face eco system etc.
Evaluate and optimize model performance
Develop and maintain comprehensive documentation for model architectures, training recipes and deployment procedures

Essential Skills and Qualifications:

High Priority:

Bachelor's or Master's degree in Computer Science, Data Science, or a related field.
Proven experience in developing and deploying large language models, with a focus on text generation.
Strong understanding of LLM and deep learning techniques, such as Prompt Engineering, Fine-Tuning, RAG, LoRA, etc.
Strong proficiency in programming languages including Python and deep learning frameworks like PyTorch and Tensorflow.
Familiarity with prompt engineering techniques and their application in enhancing LLM outputs.
Excellent problem-solving skills and the ability to work collaboratively in a fast-paced startup environment.
Strong communication skills, with the ability to articulate complex technical concepts to non-technical stakeholders.