Senior Embedded Engineer - Warszawa

💡 Polecam: Zobacz również podobne oferty pracy, z których na pewno coś wybierzesz.

Senior Embedded Engineer
Warszawa
Warszawa, Mazowieckie, Polska

Your new company
For our Client - global technology and analytics services company, we are currently looking for a person interested in the position of a Senior Embedded Engineer.

Your new role

Develop and optimise AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on local LLM inference.
Influence the Edge AI strategy by providing expert advice on design and architecture.
Implement and fine-tune low-latency model inference pipelines to meet real-time performance requirements.
Collaborate with the GPU Hardware Design Team to design and optimise GPUs that power next-generation devices.
Conduct performance profiling and optimisation to maximise the efficiency of GPU/TPU acceleration for local LLM inference.
Work on micro-architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints.
Collaborate with cross-functional teams to integrate AI inference solutions into edge computing platforms and applications.
Make critical decisions regarding technical directions, scalability, and system performance.
Provide technical expertise and support to project teams, ensuring successful implementation and deployment of edge AI solutions.

What you'll need to succeed

We’re seeking a Senior Embedded Engineer with expertise in Edge AI to join our clients' team. As a key contributor, you’ll shape the future of Edge AI solutions. Combine technical excellence with effective leadership to drive projects forward with hands-on experience with Large Language Models inference using embedded GPU/TPU architectures.

5+ years of experience in AI model development and deployment, with a focus on edge computing.
Competence in LLM frameworks (e. g. , vLLM, Text generation inference, OpenLLM, Ray Serve, and HuggingFace Transformers) and deep learning libraries.
Experience with GPU/TPU acceleration for AI inference, including optimisation techniques (tensor, pipeline, data, sharded data parallelism) and performance tuning.
Programming skills in languages such as Python and C++.
Familiarity with one or more GPU frameworks: CUDA, Vulkan, OpenCL, familiarity with NVIDIA Jetson, ARM Mali, or relevant SoC configurations.
Knowledge of parallel computation, memory scheduling, and structural optimisation.

What's in it for You?

Remote work
Medical subscription
Free unlimited access to Udemy – 5 days off yearly to enjoy courses
Paid study holiday for bachelor students
Referral bonus
Long-term contribution rewards
Lunch & Learn sessions
Company sports competitions, hackathons, reading marathons
Social community events
Team building parties, team events

What you need to do now
If you're interested in this role, click 'apply now' to forward an up-to-date copy of your CV, or call us now.

Hays Poland sp. z o. o. is an employment agency registered in a registry kept by Marshal of the Mazowieckie Voivodeship under the number 361

Informacje:

Firma:	HAYS POLAND Sp. z o.o.
Lokalizacja:	Warszawa Warszawa, Mazowieckie, Polska
Dodano:	19. 7. 2024 Praca na stanowisku - aktualna

Odpowiedz na ogłoszenie 7 osób już obejrzało tę ofertę pracy.

Dlaczego szukać pracy na HitPraca.pl?

Subskrybuj oferty pracy

	Codziennie nowe oferty pracy
	Możesz wybierać z bardzo szerokiej gamy ofert pracy - naszym celem jest posiadanie jak najszerszej oferty pracy
	Otrzymuj nowe oferty e-mailem
	Bądź pierwszym, który odpowie na nowe oferty pracy
	Wszystkie oferty pracy w jednym miejscu (od pracodawców, agencji pośrednictwa pracy i innych portali)
	Wszystkie usługi dla kandydatów do pracy są bezpłatne
	Pomożemy Ci znaleźć nową pracę