Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.

I

Member of Technical Staff, Research Engineer (Inference)

inflectionai · 30+ days ago

Palo Alto, CA

$175-325k

Full-time

Continue

By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.

What We're Building

As Inflection embarks on a new stage of growth, we are focusing on collaborating with commercial partners to adapt and fine-tune our cutting-edge models for their unique business requirements. Our accomplishments in developing, aligning, and deploying state-of-the-art models in our high EQ consumer-facing chatbot, Pi, have established a strong foundation for success. Well-funded and equipped with ample H100 resources, we have built a robust infrastructure and efficient processes to support best-in-class finetuning. By joining our team, you'll have the opportunity to contribute your expertise while being part of a dynamic organization that values innovation and collaboration.

About Inflection

Inflection is a small, interdisciplinary AI studio. We have trained several state-of-the-art language models, including Inflection 1 and Inflection 2.5, and built a personal assistant named Pi. As a studio, we are currently focused on finetuning and deploying models for specific use cases for our commercial partners.

We believe that artificial intelligence represents the beginning of an era of exponential change. Our name Inflection embraces this moment of transformation, whilst our status as a public benefit corporation provides us with the legal mandate to prioritize the well-being and happiness of our partners, users, and wider stakeholders above all else.

About the Role

Member of Technical Staff, Research Engineer (Inference)

As part of Inflection’s commitment to deploying high-performance models for enterprise applications, our inference team ensures that these models run efficiently and effectively in real-world scenarios. Research engineers in this role focus on optimizing model inference processes, reducing latency, and improving throughput without compromising model performance, ensuring robust deployment in enterprise environments.

This is a good role for you if you:

Have experience with deploying and optimizing LLMs for inference, both in cloud and on-prem environments.
Are adept at using tools and frameworks for model optimization and acceleration, such as ONNX, TensorRT, or TVM.
Enjoy troubleshooting and solving complex problems related to model performance and scaling.
Have a deep understanding of the trade-offs involved in model inference, including hardware constraints and real-time processing requirements.
Are proficient with PyTorch and familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines.

Employee Pay Disclosures

At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary will fall in the range of approximately $175,000 - $325,000 depending on experience. This estimate can vary based on the factors described above, so the actual starting annual base salary may be above or below this range.

How We Work

We value excellence and ownership. Our organizational structure focuses on individual responsibilities rather than management hierarchies. Everyone is expected to lead by doing. We are big believers in the unreasonable effectiveness of highly talented Individual Contributors who are given all the resources, space and ownership to move fast and deliver outstanding results.

Teamwork and generosity are at our core. Our culture celebrates positive challenges, asking questions, learning and actively supporting one another. This mentality of shared respect and purposeful teamwork is key to our success. We equally value all technical and non-technical contributions.

Constructive disagreement is essential. We appreciate when team members challenge assumptions, put forward new ideas, or encourage us to move faster or slower. Openness, honesty and kindness make us great.

Feedback is our ground truth. We have a tight feedback loop between the user experience and our AI creation process. Quantitative and qualitative data drives our priorities. This goes for internal culture too. Everyone has ownership and visibility into key decisions and progress.

Writing creates accountability. Whether on internal communication tools or in team memos, we are strong communicators with a special focus on the written word.

We deeply value time to reset outside of work. We encourage one another to constantly take time to recharge and always focus on maintaining a healthy work-life balance.

Engineering at Inflection

We are a vertically integrated AI studio. This means that our entire technology stack – from large foundational model pre-training to the user interface – is built in-house, with each of the components co-optimized to deliver the best AI experiences. We have built one of the most advanced large language models in the world, based on multiple novel and proprietary innovations.

We believe in scale as the engine of progress in AI, and we are building one of the largest supercomputers in the world to develop and deploy the new generation of AIs.

We wear multiple hats and don’t distinguish between engineering and research. We continuously explore and exploit, creating new and perfecting existing techniques and solutions. User feedback is our North Star.

Our Benefits

We offer generous benefits to ensure a positive, safe, inclusive and inspiring work environment for all Inflectioneers.

Unlimited paid time off
Parental leave and flexibility for all parents and caregivers
Generous medical, dental and vision plans for US employees
Compliance with country-specific benefits for non-US employees
Visa sponsorship for new hires
Avenues for personal growth such as coaching, conference attendance, or specific trainings

Diversity & Inclusion

We are building personal AIs that we hope will serve everyone. We are deeply committed to representing the full extent of the human experience inside our AI Studio. This means that everyone from any walk of life is welcome if you have the right skills. We populate diverse candidate pools for all open roles.

•

Last updated on Aug 19, 2024

About the company

I

inflectionai

More jobs at inflectionai

Analyzing

Member of Technical Staff, Platform Software Engineer$150-250k

Palo Alto, California

·

30+ days ago

Member of Technical Staff, Research Engineer (Finetuning)

Palo Alto, California

·

30+ days ago

Member of Technical Staff, Machine Learning Software Engineer

Palo Alto, California

·

30+ days ago

Member of Technical Staff, Research Engineer (Inference)

Palo Alto, California

·

30+ days ago

Member of Technical Staff, Research Engineer (Pretraining)

Palo Alto, California

·

30+ days ago

For job seekers

Job searchSearch millions of jobs

LaunchpadNew

Resume to business

Cover Letter StudioGenerate a cover letter

Add to ChatGPTFind and discuss jobs

For employers and recruiters

Resume ScreenerOrganize and rank candidates

Talent PipelinePre-order

Source top talent

Promote a jobReach more candidates

For everyone

Referral programEarn 30% commission

Get mobile appBrowse anywhere

Jobs API

Become a partner

Developed by Blake and Linh in the US and Vietnam.

We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord

Changelog

🚀 LaunchpadNov 27

Create a site and sell services based on your resume.

🔥 Job search dashboardNov 13

Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.

🫡 Cover letter instructionsSep 27

New Studio settings give you control over AI output.

✨ Cover Letter StudioAug 9

Automatically generate cover letters for any job.

🎯 Suggested filtersAug 6

Copilot suggests additional filters above the results.

⚡️ Quick applicationsAug 2

Apply to jobs using info from your resume. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.

🧠 Job AnalysisJul 12

Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.