Match score not available

Research Engineer - Vision (VLM, Vision LLM)

Remote:

Full Remote

Work from:

European Union

Offer summary

Qualifications:

PhD in Computer Science or equivalent experience, Deep theoretical understanding of Computer Vision, Experience with visual language models, Ability to lead independent research initiatives.

Key responsabilities:

Develop novel architectures for computer vision
Advance theoretical foundations for AI Agents and action models

techire ai http://www.techire.ai

2 - 10 Employees

See all jobs

Job description

Are you passionate about solving fundamental challenges in Vision and multimodal AI understanding?

You should consider joining a pioneering European AI Agent company as a Research Engineer to help advance the theoretical foundations of how AI systems process and understand complex interfaces.Their mission is to break new ground in AI's ability to comprehend multimodal data from real-world interfaces. The next stage of their journey is to solve core scientific challenges in visual understanding and its relationship with underlying document structures

.
The company has established strong research foundations in multimodal AI, and now seeks to achieve scientific breakthroughs in visual-linguistic understanding and structured data interpretatio

n.
Key Research Are

as:Develop novel architectures for computer vision and structured multimodal understand
ingAdvance the theoretical foundations of representation spaces for AI Agents and action mod
elsPioneer new multimodal approaches for the best resu
ltsPush the boundaries of structure understand

ingRequiremen

ts:PhD in Computer Science, Machine Learning, or related field or equal practical experience in industry plus a Maste
rs.Deep theoretical understanding of Computer Vision and multimodal learning architectu
resExperience with visual language models (e.g., LLaVA or simil
ar)Ability to lead independent research initiati
vesExperience in the whole research pipeline from research, experimentation, building & training models, improving performance e

tc.Nice to ha

ve:Experience working on structured understanding, document understanding or simi
larStrong publications record in top-tier conferences (NeurIPS, ICML, ICLR, CV

PR)This role offers the opportunity to conduct groundbreaking research in multimodal AI understanding. You'll be investigating fundamental questions at the intersection of vision, language, and structured data interpretation, with the goal of achieving significant scientific breakthroug

hs.
The company values scientific innovation and provides an environment conducive to pursuing novel research directions. They offer competitive compensation of Tier 1 AI lab salaries, plus attractive equity opti

ons.
The position is remote in Europe, with some regular travel to meet with the team in P

aris.
If you're excited about conducting pioneering research in computer vision and multimodal AI and want to contribute to the theoretical foundations of next-generation AI systems, this could be your ideal opportunity to make a lasting i

mpact.
Interested in being part of this AI revolution? Apply today. All applicants will receive a re

sponse.