Match score not available

Research Engineer - Vision (VLM, Vision LLM)

Remote: 
Full Remote
Work from: 

Offer summary

Qualifications:

PhD in Computer Science or equivalent experience, Deep theoretical understanding of Computer Vision, Experience with visual language models, Ability to lead independent research initiatives.

Key responsabilities:

  • Develop novel architectures for computer vision
  • Advance theoretical foundations for AI Agents and action models
techire ai logo
techire ai http://www.techire.ai
2 - 10 Employees
See all jobs

Job description

Are you passionate about solving fundamental challenges in Vision and multimodal AI understanding?



You should consider joining a pioneering European AI Agent company as a Research Engineer to help advance the theoretical foundations of how AI systems process and understand complex interfaces.Their mission is to break new ground in AI's ability to comprehend multimodal data from real-world interfaces. The next stage of their journey is to solve core scientific challenges in visual understanding and its relationship with underlying document structures


.
The company has established strong research foundations in multimodal AI, and now seeks to achieve scientific breakthroughs in visual-linguistic understanding and structured data interpretatio


n.
Key Research Are

  • as:Develop novel architectures for computer vision and structured multimodal understand
  • ingAdvance the theoretical foundations of representation spaces for AI Agents and action mod
  • elsPioneer new multimodal approaches for the best resu
  • ltsPush the boundaries of structure understand

ingRequiremen

  • ts:PhD in Computer Science, Machine Learning, or related field or equal practical experience in industry plus a Maste
  • rs.Deep theoretical understanding of Computer Vision and multimodal learning architectu
  • resExperience with visual language models (e.g., LLaVA or simil
  • ar)Ability to lead independent research initiati
  • vesExperience in the whole research pipeline from research, experimentation, building & training models, improving performance e


tc.Nice to ha

  • ve:Experience working on structured understanding, document understanding or simi
  • larStrong publications record in top-tier conferences (NeurIPS, ICML, ICLR, CV


PR)This role offers the opportunity to conduct groundbreaking research in multimodal AI understanding. You'll be investigating fundamental questions at the intersection of vision, language, and structured data interpretation, with the goal of achieving significant scientific breakthroug


hs.
The company values scientific innovation and provides an environment conducive to pursuing novel research directions. They offer competitive compensation of Tier 1 AI lab salaries, plus attractive equity opti


ons.
The position is remote in Europe, with some regular travel to meet with the team in P


aris.
If you're excited about conducting pioneering research in computer vision and multimodal AI and want to contribute to the theoretical foundations of next-generation AI systems, this could be your ideal opportunity to make a lasting i


mpact.
Interested in being part of this AI revolution? Apply today. All applicants will receive a re

sponse.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Problem Solving

Related jobs