Su Myat Noe

Ph.D. · Project Researcher, National Institute of Informatics (NII), Tokyo
Multimodal AI Safety · Multilingual LLM Evaluation · Vision-Language Models

"To build AI that is powerful, responsible, and accessible."

Google Scholar GitHub LinkedIn LinkTree Email

About

I am a Ph.D.-trained researcher at the National Institute of Informatics (NII), Tokyo, working on multimodal AI safety — specifically how frontier vision-language models (VLMs) behave when safety matters, across languages, modalities, and cultures.

My core project is extending DeepMind's Multimodal Safety Test Suite (MSTS) to Japanese as the 12th language. This is not a translation task — it is a full cross-linguistic safety evaluation involving human-translated and human-edited prompts, model evaluations across both multimodal (image + text) and text-only conditions, and a rigorous human annotation campaign (1,700+ safety judgments). I evaluate five frontier VLMs — including GPT-5, Gemini-2.5-Flash, Qwen2.5-VL, InternVL2.5, and LLM-JP-3-VILA — and find that Japanese inputs consistently produce higher violation rates than English (+10 to +46 percentage points depending on model and condition). This reveals a systematic safety gap that text-only evaluation misses entirely.

On evaluation methodology, I build and validate automated LLM-as-Judge pipelines, comparing them against human judgments through correlation analysis and ablation studies across modality and language conditions. A key finding: text-only reference answers actively hurt automated evaluation when applied to multimodal prompts — because models appropriately incorporate visual context that text-only references don't anticipate. This has direct implications for how we design reward signals for safer model alignment.

Beyond safety evaluation, I contribute to an end-to-end agent evaluation framework for Japanese, localizing benchmarks like OSWorld to assess whether LLM-powered agents can complete real-world tasks — not just call tools. I also study agentic architectures empirically: my ICAART 2026 paper shows that well-designed single-agent systems significantly outperform multi-agent decomposition for vision-based reasoning tasks.

I serve as Session Chair at AAAI and ICAART, Women in AI Myanmar Board Member, and GDG Tokyo volunteer. Trilingual: English · Japanese (JLPT N3) · Burmese.

179Citations

7h-index

16Publications

8Awards & Grants

Publications

2026

NLP 2026

Multimodal Safety Evaluation of Vision-Language Models in Japanese

Su Myat Noe, H. Suzuki, N. Okazaki

The 32nd Annual Conference of the Association for Natural Language Processing · Japan

2026

NLP 2026

AnswerCarefully Dataset Extension: Adding Culturally Sensitive Issues and Multimodal Questions

H. Suzuki, T. Takahashi, Su Myat Noe

The 32nd Annual Conference of the Association for Natural Language Processing · Japan

2026

ICAART 2026

An Empirical Study of Architectural Trade-offs in Vision-based Traffic Sign Interpretation Systems

Su Myat Noe, H. T. Nguyen, M. M. Zin, K. Satoh

18th International Conference on Agents and Artificial Intelligence · Marbella, Spain

2026

LREC 2026

BIS Reasoning 1.0: A Large-scale Japanese Benchmark for Belief-inconsistent Reasoning

H. T. Nguyen et al., Su Myat Noe

Joint International Conference on Computational Linguistics, Language Resources and Evaluation

2025

Scientific Reports

Optimizing Black Cattle Tracking in Complex Open Ranch Environments Using YOLOv8 Embedded Multi-Camera System

Su Myat Noe et al.

Scientific Reports, 15, 6820 (Nature Portfolio) · IF: 3.8 DOI ↗

2024

ICGEC 2024

From Vision to Vocabulary: A Multimodal Approach to Detect and Track Black Cattle Behaviors

Su Myat Noe et al.

International Conference on Genetic and Evolutionary Computing ★ Best Student Paper Award

2024

NCSP 2024

Multi-Camera System for Black Cattle Tracking and Behavior Detection Using YOLO and SAM

Su Myat Noe et al.

RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing · Hawaii ★ Best Paper Award (Top 1/25)

2023

Sensors

Comparing State-of-the-Art Deep Learning Algorithms for Automated Detection and Tracking of Black Cattle

Su Myat Noe et al.

Sensors, 23(1), 532 · MDPI · IF: 3.9 DOI ↗

2021

LifeTech 2021

Deep Learning-based Black Cattle Detection and Tracking Using Multi-Camera System

Su Myat Noe et al.

IEEE International Conference on Consumer Electronics – Taiwan ★ IEEE WIE Best Paper Award

Full list: 4 journal papers (Scopus-indexed) · 12 conference papers · Google Scholar ↗

Selected Projects

🚦 Add screenshot

Agentic AI · ICAART 2026

Traffic Sign Interpretation

Single-agent vs. multi-agent architectures for vision-based traffic sign interpretation. Single-agent significantly outperforms (8.21 vs 7.74, p=0.012).

Experience

Apr 2025 – Present

Project Researcher

National Institute of Informatics (NII) — LLMC · Tokyo, Japan

Multimodal safety evaluation · LLM-as-Judge pipelines · Japanese agent benchmark · VLM evaluation

2018

Junior Data Scientist

Bagan Innovation Technology · Myanmar

Low-resource NLP for Myanmar language

2017

Junior System Developer

Fujitsu Japan · Japan

Enterprise web applications · PHP · JavaScript

Education

2022 – 2025

Ph.D. in Computer Science

University of Miyazaki, Japan

Computer vision · multi-camera tracking · Advisor: Dr. Thi Thi Zin · JST Doctoral Grant · NEC C&C Grant

2020 – 2022

M.Eng. in Energy and Electronics

University of Miyazaki, Japan

Naoji Iwatani Full Master Scholarship Award

2013 – 2018

B.C.Sc. in Computer Science (Knowledge Engineering)

University of Information Technology, Myanmar

Awards & Grants

Encouragement Women Research Award — University of Miyazaki2025

Best Student Paper Award — ICGEC Conference2024

Best Paper Award (Top 1/25) — RISP NCSP Workshop, Hawaii2024

President's Research Funding Award — University of Miyazaki2024–2025

JST Doctoral Researchers Grant2022–2025

NEC C&C Research Grant2022–2023

IEEE Women in Engineering Best Paper Award — LifeTech2021

Naoji Iwatani Full Master Scholarship Award2020–2022

Community & Service

Women in AI Myanmar

Board Member & Ambassador · 2022 – Present

2022 – Present

I believe access to AI education should not depend on geography, gender, or privilege. Since 2022, I have served as an Ambassador — and now Board Member — of Women in AI Myanmar, helping lead national programs that open doors for students exploring AI for the first time, with a focus on women and underrepresented communities.

In 2025 alone, these programs reached over 900 participants across three initiatives:

153 Responsible AI Workshop
participants

305 ML Summer School applicants
(89 selected for 6-month program)

482 Scholarship knowledge-sharing
session participants

As a trilingual communicator (English · Japanese · Burmese), I also publish AI articles on Medium and speak at events across global communities — translating cutting-edge research into accessible insights for audiences encountering these ideas for the first time.

Workshop Organizer — ECCV 2026 LIMIT Workshop, Sweden Sep 2026

Workshop Organizer — ASPIRE CV Workshop, University of Oxford & AIST Apr 2026

Session Chair — AAAI & ICAART 2025–2026

Google Developer Group Tokyo — Staff Volunteer 2025–Present

Google Developer Student Club Lead, Japan 2022–2023

Invited Talks — Google I/O Extended Yangon 2025 · Women in AI ML Summer School 2025 · Sakura Workshop Taiwan 2023 2023–2025

Contact

Open to research collaborations, applied scientist roles, and conversations about multilingual AI safety in the APAC region.

sumyatnoe.uit@gmail.com LinkedIn Google Scholar GitHub