Michael Donoser, Ph.D., Director of Science

About

As a Director of Science for Amazon in Berlin, I lead a cross-functional, centralized research and development team focused on delivering innovative, customer-centric products. Our mission is to turn cutting-edge research into real-world solutions that delight our customers.

As a centralized science team, we have incredible opportunities to identify and develop the most impactful computer vision use cases across Amazon's entire business, allowing us to tackle a diverse set of applications in collaboration with multiple business teams.

With a background spanning academia and industry, I've found my true calling at Amazon. Here, I'm able to channel my passion for science and technology into tangible results, empowering my team to push the boundaries of what's possible. My role is about more than just managing: it's about fostering an environment where brilliant minds can thrive. Seeing their innovations come to life and create value for our customers is what drives me every single day.

🏢

Director of Science at Amazon

10+ years leading CV & AI teams in Germany · 25+ products shipped · multi-billion dollar customer impact

🔬

Research Focus

Generative AI · Computer Vision
Image & Video Understanding and Generation

🧠

Leadership & Coaching

Developing the next generation of AI leaders through coaching in Emotional Intelligence and Mental Models in fast-paced, high-stakes environments.

🎓

Ph.D. in Computer Science

Graz University of Technology, Austria, 2007
Graduated with highest distinction

🏫

M.Sc. Information and Computer Engineering

Graz University of Technology, Austria, 2003
Graduated with highest distinction

Leadership & Coaching

One of my deepest passions is developing the next generation of leaders in fast-paced industry environments, helping brilliant scientists and engineers grow not just as technical experts, but as self-aware, resilient, and impactful human beings.

After more than a decade leading science teams at Amazon, I've come to believe that the biggest leverage point in any organisation isn't the technology: it's the people. Coaching and mentoring is not a side activity for me; it's a core part of how I lead. I invest deeply in building the inner toolkit of my team: the resilience to own outcomes, the clarity to cut through noise, and the self-awareness to stay grounded when everything around them is moving fast.

The rise of Generative AI has made this more urgent, not less. The pace of change is relentless, the stakes are high, and the old playbooks don't always apply. What endures is a strong inner foundation: a growth mindset, sound mental models, honest communication, and the courage to build teams where people can do their best work. These are skills that can be learned and trained, and I'm deeply committed to helping people build them.

🌿

Growth Mindset & Continuous Learning

Intelligence and skills are built, not fixed. I help people embrace discomfort as a signal of growth: to ask the questions they've been afraid to ask, treat failure as data, and stay relentlessly curious in a world where the half-life of knowledge is shrinking fast.

Related writing

Fearing to Look Stupid Holds You Back Sometimes We Win, Sometimes We Learn

🎯

Resilience, Ownership & Accountability

Exceptional leaders refuse the victim mindset. When projects derail, they ask “What can I do right now?” rather than crafting explanations. I coach on reclaiming agency: owning outcomes, not just credit, and choosing usefulness over comfort when it matters most.

Related writing

Take Action Instead of Pointing Fingers Victim Mentality vs. Proactive Attitude

💬

Feedback, Communication & Influence

Honest, direct communication is a force multiplier. I work with leaders on giving and receiving feedback without flinching, managing up effectively, and influencing without formal authority: the skills that determine whether great ideas actually get implemented.

Related writing

How To Communicate to AI The Art of Seeking Feedback

🤝

Building High-Performance Teams

Great teams are built on psychological safety, not just talent. I help leaders spot and dismantle the subtle dysfunctions, like teams that optimise for appearances over outcomes, and create environments where top performers do the best work of their lives.

Related writing

The Potemkin Organization Psychological Safety at Work

🧩

Mental Models & Strategic Clarity

Good decisions come from good thinking frameworks. I help leaders build a toolkit of mental models: first-principles reasoning, inversion, systems thinking, so they can cut through noise, know when to persist and when to stop, and reason from fundamentals rather than analogy.

Related writing

The Secret to Being Right A Lot The Power of Saying No

⚡

Leading in the Age of Generative AI

The old playbooks don't fully apply anymore. I share a hands-on perspective on how AI is reshaping leadership: from using it as a judgment-free thought partner and learning environment, to prototyping ideas in minutes, to re-thinking what it means to create impact in AI-native organisations.

Related writing

From Idea to Working App in 30 Minutes How AI Is Changing My Day-to-Day Work

Career

2023 – Present Industry

Director of Science

Amazon Development Center Germany · Berlin, Germany

Leading a cross-functional Computer Vision & AI innovation organization across multiple German sites. Defining long-term CV & AI strategy with direct reporting to Retail VP. Key production launches include image & video generation, visual recommendations, image translation, shoppable images, image & video moderation, virtual try-on applications, visual copyright infringement detection, video maturity recognition, and accessibility features on Echo Screen Devices.

2018 – 2023 Industry

Senior Applied Scientist Manager

Amazon Development Center Germany · Berlin, Germany

Grew and led the Applied Science team, fostering research-to-production pipelines for computer vision applications across multiple Amazon product lines.

2014 – 2018 Industry

Applied Scientist Manager

Amazon Development Center Germany · Berlin, Germany

Founded and built the computer vision science team in Berlin, Germany, establishing the infrastructure and culture for research-driven product innovation at Amazon Berlin.

2013 – 2014 Academia

Senior Researcher (Tenure Track)

Institute for Computer Graphics and Vision · Graz University of Technology, Austria

Research on image segmentation, shape matching, and retrieval. Supervision of PhD students and international research collaborations.

2008 – 2012 Academia

Assistant Professor · Leader, Virtual Habitat Group

Institute for Computer Graphics and Vision · Graz University of Technology, Austria

Led the Virtual Habitat research group. Supervised eight PhD students.

2003 – 2008 Academia

Research Associate · PostDoc · Ph.D. Candidate

Institute for Computer Graphics and Vision · Graz University of Technology, Austria

Thesis:"Advanced Segmentation and Tracking Algorithms and Their Application to 3D Paper Structure Analysis". Graduated with highest distinction.

1996 – 2003 Academia

M.Sc. in Telematik (Information and Computer Engineering)

Institute for Computer Graphics and Vision · Graz University of Technology, Austria

Thesis: "Object Segmentation in Videos". Graduated with highest distinction.

Recognition & Awards

2008

🎍

Josef-Krainer Förderungspreis

Awarded by the State Government of Styria (Austria) for outstanding early-career scientific achievement.

Link to Article (German) →

2008

🥇

Best Scientific Paper Award

International Conference on Pattern Recognition (ICPR), "Using Web Search Engines to Improve Text Recognition"

View award photo →

2009

🥇

Best Scientific Paper Award

OAGM/AAPR Workshop, "Finding Stable Extremal Region Boundaries"

2010

🥇

Best Scientific Paper Award

OAGM/AAPR Workshop, "MSER Templates for 3D Pose Tracking"

2012

🥇

Best Scientific Paper Award

Asian Conference on Computer Vision (ACCV): "Detecting Partially Occluded Objects with an Implicit Shape Model Random Field"

Writing

Selected posts on leadership, learning, and the age of Generative AI. Published on LinkedIn and Medium.

Leadership & Identity

6 min read

I am a Director of Science at Amazon. I've Been Winging It My Whole Life.

A candid reflection on imposter syndrome, self-doubt, and the messiness behind a career that looks polished from the outside.

Generative AI

11 min read

From C64 to Claude: How AI Finally Let Me Build My Dream Game

A childhood dream deferred by programming complexity, finally realised with AI. On creativity, persistence, and what it means to build something truly your own.

Growth Mindset

Fearing to Look Stupid Holds You Back

AI creates the first truly judgment-free learning environment in human history. The only barrier to growth is giving yourself permission to ask.

Resilience & Ownership

Take Action Instead of Pointing Fingers

When projects derail, leaders reveal their true nature. Exceptional ones refuse the victim mindset and ask: what can I do right now?

Communication & Influence

How To Communicate to AI

The same principles that make us effective communicators with humans apply when prompting AI. Mastering one may sharpen the other.

Building Teams

Kaizen Town Halls: Building a Culture of Continuous Improvement

A regular forum where every team member has a voice. How Kaizen Town Halls build psychological safety and turn tensions into rapid, meaningful change.

Building Teams

The Potemkin Organization

When teams optimise for how they appear to leadership rather than real outcomes, the facade can look strong right up until it suddenly isn't.

Mental Models

The Secret to Being Right A Lot

Great decisions don't come from experience alone. They come from building a diverse toolkit of thinking frameworks that cut through noise and bias.

Generative AI

From Idea to Working App in 30 Minutes

When you can build your own tools in less than a lunch break, you move from passive observer to proactive creator. The only remaining obstacle is curiosity.

Publications

Over 50 accepted, peer-reviewed publications spanning computer vision, machine learning, and artificial intelligence.
Full record on Google Scholar.

Google Scholar Impact

3,221 Citations 891 since 2021

27 h-index 13 since 2021

49 i10-index 19 since 2021

View full profile on Google Scholar →

Published at the World's Top Computer Vision Conferences

26 papers accepted at the premier "Big Five" CV venues — among the most selective in all of computer science.

CVPR 9 papers

Conference on Computer Vision & Pattern Recognition

Ranked #1 globally among all CS conferences · ~22% acceptance rate · 13,000+ annual submissions

ICCV 3 papers

International Conference on Computer Vision

Biennial IEEE flagship of computer vision · ~25% acceptance rate · 8,000+ submissions

ECCV 2 papers

European Conference on Computer Vision

Co-flagship with ICCV · ~28% acceptance rate · 8,500+ submissions

ACCV 4 papers

Asian Conference on Computer Vision

🏆 Best Paper Award 2012

Premier Asian venue · rigorous peer review

BMVC 8 papers

British Machine Vision Conference

Leading European specialty venue · h5-index 65 · rigorous peer review

OAGM 2 papers

Austrian Workshop on Computer Vision

🏆 2× Best Paper Award

Largest CV conference in Austria · running since 1981 · 43+ editions

ICPR 12 papers

International Conference on Pattern Recognition

🏆 Best Paper Award 2008

IEEE’s flagship pattern recognition venue · biennial · running since 1973

Selected Publications

Highlights from 50+ papers — top-cited works & award winners.

2021 CVPR Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning

Cross-modal recipe retrieval — matching food images to recipes and vice versa — is key for intelligent food discovery. This paper introduces a hierarchical recipe Transformer that attentively encodes individual recipe components (titles, ingredients, instructions), paired with a self-supervised loss that leverages semantic relationships within recipes. The approach enables training on both image-recipe and recipe-only samples, achieving state-of-the-art results on the Recipe1M dataset. Code and models released publicly via Amazon’s GitHub.

2021 ICCV Learning Attribute-driven Disentangled Representations for Interactive Fashion Retrieval

Interactive fashion retrieval lets users refine image search by modifying specific visual attributes such as color or sleeve type. Existing methods learn entangled embedding spaces, causing unintended changes when a single attribute is manipulated. This paper trains convolutional networks to learn separate, attribute-specific subspaces so that swapping one attribute leaves others unaffected. The unified model handles attribute manipulation retrieval, conditional similarity retrieval, and outfit complementary item retrieval, achieving state-of-the-art performance across all three tasks.

2014 CVPR Discrete-Continuous Gradient Orientation Estimation for Faster Image Segmentation

State-of-the-art image segmentation relies on hierarchical structures built from local feature cues analyzed via spectral methods, but both steps remain computational bottlenecks. This paper shows that a discrete-continuous optimization of oriented gradient signals delivers segmentation performance competitive with the state-of-the-art on the BSDS 500 benchmark — without any spectral analysis — reducing computation time by a factor of 40 and memory requirements by a factor of 10.

2014 CVPR Discriminative Feature-to-Point Matching in Image-Based Localization

Image-based localization matches query image interest points to a sparse 3D point cloud to recover camera pose. Standard approaches represent each 3D point by a single descriptor centroid, ignoring the full set of associated 2D descriptors. This paper reformulates matching as a discriminative classification problem using random ferns, which is memory- and runtime-efficient. An extension projects features into fern-specific embedding spaces, improving match rates and outperforming nearest-neighbor baselines.

2013 CVPR Diffusion Processes for Retrieval Revisited

Diffusion on affinity graphs propagates similarity information across a manifold to improve retrieval ranking. This paper provides a comprehensive revisit of diffusion-based retrieval, surveys the state-of-the-art, and derives a unified generic framework of which prior methods are special cases. Evaluating the framework across multiple retrieval tasks yields new algorithm variants; one achieves a perfect 100% bullseye score on the widely used MPEG-7 shape retrieval benchmark.

2012 ACCV Detecting Partially Occluded Objects with an Implicit Shape Model Random Field 🏆 Best Paper Award

This paper extends the Implicit Shape Model for object detection into a probabilistic random field formulation that naturally handles partial occlusion. A sparse graph encodes patch-to-instance voting in a semantically meaningful label space. A novel inference procedure efficiently minimizes the resulting energy without a fixed non-maximum suppression radius, cleanly separating even strongly overlapping object instances.

2010 OAGM MSER Templates for 3D Pose Tracking 🏆 Best Paper Award

Closed MSER contours detected on a reference object form stable, perspective-covariant templates. A classifier trained on these regions enables robust, frame-to-frame tracking, with 3D pose recovered via perspective-n-point from matched regions. The approach leverages MSER’s efficiency and robustness to illumination change, enabling real-time pose estimation on textured planar and near-planar targets.

2010 ECCV Using Partial Edge Contour Matches for Efficient Object Category Localization

Object category localization is cast as a partial edge-contour matching problem against a single shape prototype, avoiding error-prone pre-processing such as contour decomposition or interest-point detection. All extracted edges participate in a partial contour matching step; matched fragments vote for location hypotheses via generalized Hough-style accumulation. The method yields competitive results on challenging benchmarks including ETHZ shapes and INRIA horses at low computational cost.

2009 OAGM Finding Stable Extremal Region Boundaries 🏆 Best Paper Award

Rather than responding to local brightness discontinuities like Canny-style detectors, this method detects edges by identifying boundaries of maximally stable image regions. A component tree is built by thresholding the image at all levels; nodes at adjacent levels are compared to find region boundaries that remain geometrically consistent across a range of thresholds. The result is a fast, mid-level edge detector that captures perceptually meaningful boundaries.

2009 ICCV Saliency Driven Total Variation Segmentation

Affinity propagation clustering on local color and texture models identifies multiple salient image regions. Each salient region seeds a figure/ground segmentation obtained by minimizing a convex weighted total variation energy, guaranteeing a globally optimal binary solution per seed. The resulting redundant segmentations are merged into a single composite by analyzing local certainty across all solutions, producing coherent multi-region outputs without user initialization.

2008 ICPR Using Web Search Engines to Improve Text Recognition 🏆 Best Paper Award

Initial character recognition via MSER-based text detection yields noisy per-character hypotheses. This paper exploits web search engines at two levels of contextual granularity to re-rank and correct initial outputs: word-level queries filter implausible character sequences, and phrase-level queries leverage web co-occurrence statistics. Even a low-quality base recognizer achieves substantially improved accuracy when augmented with this web-based contextual post-processing.

2006 CVPR Efficient Maximally Stable Extremal Region (MSER) Tracking

MSERs are powerful local feature detectors but are expensive to recompute from scratch each frame. This paper exploits the component tree underlying MSER detection to propagate and update region identities across video frames, reducing per-MSER computation by a factor of 4–10. A weighted feature vector and backward-tracking pass improve data association and robustness, demonstrated on license plate, face, and paper-fiber tracking with consistent speedups over frame-by-frame redetection.

MichaelDonoser

About

Director of Science at Amazon

Research Focus

Leadership & Coaching

Ph.D. in Computer Science

M.Sc. Information and Computer Engineering

Leadership & Coaching

Growth Mindset & Continuous Learning

Resilience, Ownership & Accountability

Feedback, Communication & Influence

Building High-Performance Teams

Mental Models & Strategic Clarity

Leading in the Age of Generative AI

Career

Director of Science

Senior Applied Scientist Manager

Applied Scientist Manager

Senior Researcher (Tenure Track)

Assistant Professor · Leader, Virtual Habitat Group

Research Associate · PostDoc · Ph.D. Candidate

M.Sc. in Telematik (Information and Computer Engineering)

Recognition & Awards

Josef-Krainer Förderungspreis

Best Scientific Paper Award

Best Scientific Paper Award

Best Scientific Paper Award

Best Scientific Paper Award

Writing

I am a Director of Science at Amazon. I've Been Winging It My Whole Life.

From C64 to Claude: How AI Finally Let Me Build My Dream Game

Fearing to Look Stupid Holds You Back

Take Action Instead of Pointing Fingers

How To Communicate to AI

Kaizen Town Halls: Building a Culture of Continuous Improvement

The Potemkin Organization

The Secret to Being Right A Lot

From Idea to Working App in 30 Minutes

Publications

Published at the World's Top Computer Vision Conferences

Selected Publications

Personal Projects

adventure-game

rsvp-reader

learn-japanese

guitar-scale-trainer

task-manager

Find Me Online

Michael
Donoser