Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.

Master Thesis Project - 2025

modulai · 30+ days ago

Åsögatan 140A

Negotiable

Full-time

Continue

By pressing the button above, you agree to our Terms and Privacy Policy, and agree to receive email job alerts. You can unsubscribe anytime.

<h4>About Modulai</h4>Modulai’s clients range from startups to multinational companies. They all share that machine learning is central to how they operate, compete, and create value. Our services range from advisory projects and feasibility studies to end-to-end development and refinement of machine learning systems and products. We use state-of-the-art techniques, always focusing on maximizing business impact, delivering solutions in areas such as credit risk, fraud detection, dynamic pricing, recommendation systems, computer vision, natural language processing, opportunity spotting, logistics optimization, up-sell, cross-sales, smart building optimization, predictive maintenance, and route planning. <h3>Facts</h3>When doing a master thesis project at Modulai, you are invited to all team activities such as daily stand-ups, weekly learning breakfasts, monthly AWs, and other team activities. We look forward to having you as part of our team and expect you to work as much as possible in the office. One of the projects will be based in Gothenburg, and one in Stockholm. We have a strong history of master's thesis students joining Modulai for their first job in machine learning engineering. We are excited to explore this opportunity with you! <h4>1. Unified mixed-modal transformers for efficient understanding and generation (STOCKHOLM)</h4>Background & DescriptionModulai is offering a master's thesis opportunity focused on developing cutting-edge models capable of processing and generating across multiple data modalities (text, images, video and audio) within a unified framework. Current state-of-the-art multimodal models often separate tasks like visual understanding and text generation. Still, recent advancements in unified transformers demonstrate the potential to handle these tasks efficiently within a single architecture.The project will involve designing and experimenting with mixed-modal models that incorporate both autoregressive methods for text generation and diffusion-based techniques for continuous data (such as images and video). You will explore how to fuse different types of data representations—discrete tokens for text and continuous/discrete vectors for visual data—into a unified model capable of performing tasks like text-to-image generation, visual question answering, and more.The goal is to develop and investigate a scalable, unified mixed-modal model for a set of domains. This model should be capable of efficiently handling multiple data modalities within a single architecture. You will compare the mixed model’s performance against other state-of-the-art multimodal models and/or traditional modality-specific architecture. The comparison will focus on key factors such as overall performance, computational efficiency and potential for fine-tuning across specific domains. ML techniques and tools<ul><li>Transformer-based architectures</li><li>Diffusion models</li><li>Multimodality</li><li>Fine-tuning strategies</li><li>Python, PyTorch, Git</li></ul>References:<a href="https://arxiv.org/pdf/2405.09818">https://arxiv.org/pdf/2405.09818</a><a href="https://arxiv.org/pdf/2408.12528">https://arxiv.org/pdf/2408.12528</a><a href="https://arxiv.org/pdf/2408.11039">https://arxiv.org/pdf/2408.11039</a><h4> </h4><h4> 2. Large to small language model distillation thesis project (GOTHENBURG)</h4>Background & Description Modulai is offering a master’s thesis opportunity focused on knowledge distillation of large language models. Knowledge distillation, a concept popularised by Hinton et al. in 2015, involves transferring knowledge from a larger, complex "teacher" model to a smaller, more efficient "student" model. The student model learns to replicate the behaviour of the teacher model by minimising the differences in their output. A key advantage of knowledge distillation, as opposed to training the student model from scratch, is that the teacher model provides more informative soft labels (distributions across the vocabulary at each prediction step). These soft labels offer a stronger learning signal compared to the hard, one-hot labels available in regular pre-training or fine-tuning.In recent years, there has been a surge of promising research in this field, particularly focusing on applying these techniques to LLMs. There are many examples recently released such as Gemini Flash, GPT4o mini and LLama 3.2 1B and 3B created using knowledge distillation. Common methods include combining knowledge distillation with weight pruning or using reinforcement learning and imitation learning to help guide the training process. However, the actual details of how the large labs create these models are in general not known.The goal of this project is to research new distillation methods and develop a compact and efficient language model based on open weight student- and teacher-models. During the process we will shed some light into how state of the art application of knowledge distillation is done, increasing the community's knowledge.You will familiarise yourself with the latest advances in knowledge distillation, implement techniques from research papers, and experiment with different approaches. You will compare the performance of different distillation approaches, as well as baseline models, in terms of both model quality and computational efficiency.ML techniques and tools <ul><li>Transformer-based architectures</li><li>Knowledge distillation strategies</li><li>Local LLMs</li><li>Python, PyTorch, Git </li></ul><ul></ul><ul></ul><ul></ul>ReferencesLLM Pruning and Distillation in Practice: The Minitron Approach: <a href="https://arxiv.org/pdf/2408.11796">https://arxiv.org/pdf/2408.11796</a>Compact Language Models via Pruning and Knowledge Distillation: <a href="https://www.arxiv.org/pdf/2407.14679">https://www.arxiv.org/pdf/2407.14679</a>On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes: <a href="https://arxiv.org/pdf/2306.13649">https://arxiv.org/pdf/2306.13649</a>Gemma 2: Improving Open Language Models at a Practical Size: <a href="https://arxiv.org/pdf/2408.00118">https://arxiv.org/pdf/2408.00118</a>Distilling the Knowledge in a Neural Network: <a href="https://arxiv.org/pdf/1503.02531">https://arxiv.org/pdf/1503.02531</a><h4>3. Open Application within Applied Machine Learning</h4>Applied Machine Learning projects encompass a wide range of domains, including healthcare, finance, natural language processing, computer vision, and more. This open application invites students to choose projects aligned with their interests and career goals. Do you have an idea - let us know what it's about by describing it when applying. <h3>Required Skills</h3>Finishing a master's in machine learning or a master's in another field but with courses in machine learning and programming added <h4>Please include the following in your application:</h4><ul><ul><li>Link to relevant GitHub account if available. </li><li>Grades for bachelor's and master's.</li><li>Updated CV or an updated LinkedIn profile.</li><li>Preferred location - Stockholm / Gothenburg</li></ul></ul>*Suitable candidates will be called to one interview before making a final decision. The last date for application will be the 25th of October, but if suitable candidates apply, the process will end beforehand.<ul></ul><h4> </h4><ul></ul> •

Last updated on Sep 13, 2024

About the company

modulai

More jobs at modulai

Analyzing

Commercial Lead - Gothenburg Office

Göteborg, Västra Götalands län

30+ days ago

Senior Machine Learning Engineer

Göteborg, Västra Götalands län

30+ days ago

Machine Learning Engineer

Göteborg, Västra Götalands län

30+ days ago

Master Thesis Project - 2025

Stockholm, Stockholms län

30+ days ago

For job seekers

Job searchSearch millions of jobs

LaunchpadNew

Resume to business

Cover Letter StudioGenerate a cover letter

Add to ChatGPTFind and discuss jobs

For employers and recruiters

Resume ScreenerOrganize and rank candidates

Talent PipelinePre-order

Source top talent

Promote a jobReach more candidates

For everyone

Referral programEarn 30% commission

Get mobile appBrowse anywhere

Jobs API

Become a partner

Developed by Blake and Linh in the US and Vietnam.

We're interested in hearing what you like and don't like! Live chat with our founder or join our Discord

Changelog

🚀 LaunchpadNov 27

Create a site and sell services based on your resume.

🔥 Job search dashboardNov 13

Revamped job search UI with a sortable grid, live filtering, bookmarks, and application tracking.

🫡 Cover letter instructionsSep 27

New Studio settings give you control over AI output.

✨ Cover Letter StudioAug 9

Automatically generate cover letters for any job.

🎯 Suggested filtersAug 6

Copilot suggests additional filters above the results.

⚡️ Quick applicationsAug 2

Apply to jobs using info from your resume. Initial coverage of ~200k jobs in Spain, Germany, Austria, Switzerland, France, and the Netherlands.

🧠 Job AnalysisJul 12

Have Copilot read job descriptions and extract out key info you want to know. Click "Analyze All" to try it out. Click on the Copilot's gear icon to customize the prompt.