Publications Projects Experience Awards Contact

Su Myat Noe

Ph.D. · Project Researcher, National Institute of Informatics (NII), Tokyo
Multimodal AI Safety · Multilingual LLM Evaluation · Vision-Language Models

"To build AI that is powerful, responsible, and accessible."

S
About

I am a Ph.D.-trained researcher at the National Institute of Informatics (NII), Tokyo, working on multimodal AI safety — specifically how frontier vision-language models (VLMs) behave when safety matters, across languages, modalities, and cultures.

My core project is extending DeepMind's Multimodal Safety Test Suite (MSTS) to Japanese as the 12th language. This is not a translation task — it is a full cross-linguistic safety evaluation involving human-translated and human-edited prompts, model evaluations across both multimodal (image + text) and text-only conditions, and a rigorous human annotation campaign (1,700+ safety judgments). I evaluate five frontier VLMs — including GPT-5, Gemini-2.5-Flash, Qwen2.5-VL, InternVL2.5, and LLM-JP-3-VILA — and find that Japanese inputs consistently produce higher violation rates than English (+10 to +46 percentage points depending on model and condition). This reveals a systematic safety gap that text-only evaluation misses entirely.

On evaluation methodology, I build and validate automated LLM-as-Judge pipelines, comparing them against human judgments through correlation analysis and ablation studies across modality and language conditions. A key finding: text-only reference answers actively hurt automated evaluation when applied to multimodal prompts — because models appropriately incorporate visual context that text-only references don't anticipate. This has direct implications for how we design reward signals for safer model alignment.

Beyond safety evaluation, I contribute to an end-to-end agent evaluation framework for Japanese, localizing benchmarks like OSWorld to assess whether LLM-powered agents can complete real-world tasks — not just call tools. I also study agentic architectures empirically: my ICAART 2026 paper shows that well-designed single-agent systems significantly outperform multi-agent decomposition for vision-based reasoning tasks.

I serve as Session Chair at AAAI and ICAART, Women in AI Myanmar Board Member, and GDG Tokyo volunteer. Trilingual: English · Japanese (JLPT N3) · Burmese.

179Citations
7h-index
16Publications
8Awards & Grants
Publications
2026
NLP 2026
Multimodal Safety Evaluation of Vision-Language Models in Japanese
Su Myat Noe, H. Suzuki, N. Okazaki
The 32nd Annual Conference of the Association for Natural Language Processing · Japan
2026
NLP 2026
AnswerCarefully Dataset Extension: Adding Culturally Sensitive Issues and Multimodal Questions
H. Suzuki, T. Takahashi, Su Myat Noe
The 32nd Annual Conference of the Association for Natural Language Processing · Japan
2026
ICAART 2026
An Empirical Study of Architectural Trade-offs in Vision-based Traffic Sign Interpretation Systems
Su Myat Noe, H. T. Nguyen, M. M. Zin, K. Satoh
18th International Conference on Agents and Artificial Intelligence · Marbella, Spain
2026
LREC 2026
BIS Reasoning 1.0: A Large-scale Japanese Benchmark for Belief-inconsistent Reasoning
H. T. Nguyen et al., Su Myat Noe
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
2025
Scientific Reports
Su Myat Noe et al.
Scientific Reports, 15, 6820 (Nature Portfolio) · IF: 3.8 DOI ↗
2024
ICGEC 2024
From Vision to Vocabulary: A Multimodal Approach to Detect and Track Black Cattle Behaviors
Su Myat Noe et al.
International Conference on Genetic and Evolutionary Computing ★ Best Student Paper Award
2024
NCSP 2024
Multi-Camera System for Black Cattle Tracking and Behavior Detection Using YOLO and SAM
Su Myat Noe et al.
RISP International Workshop on Nonlinear Circuits, Communications and Signal Processing · Hawaii ★ Best Paper Award (Top 1/25)
2023
Sensors
Su Myat Noe et al.
Sensors, 23(1), 532 · MDPI · IF: 3.9 DOI ↗
2021
LifeTech 2021
Deep Learning-based Black Cattle Detection and Tracking Using Multi-Camera System
Su Myat Noe et al.
IEEE International Conference on Consumer Electronics – Taiwan ★ IEEE WIE Best Paper Award

Full list: 4 journal papers (Scopus-indexed) · 12 conference papers · Google Scholar ↗

Selected Projects
🚦 Add screenshot
Agentic AI · ICAART 2026
Traffic Sign Interpretation
Single-agent vs. multi-agent architectures for vision-based traffic sign interpretation. Single-agent significantly outperforms (8.21 vs 7.74, p=0.012).
Experience
Apr 2025 – Present
Project Researcher
National Institute of Informatics (NII) — LLMC · Tokyo, Japan
Multimodal safety evaluation · LLM-as-Judge pipelines · Japanese agent benchmark · VLM evaluation
2018
Junior Data Scientist
Bagan Innovation Technology · Myanmar
Low-resource NLP for Myanmar language
2017
Junior System Developer
Fujitsu Japan · Japan
Enterprise web applications · PHP · JavaScript
Education
2022 – 2025
Ph.D. in Computer Science
University of Miyazaki, Japan
Computer vision · multi-camera tracking · Advisor: Dr. Thi Thi Zin · JST Doctoral Grant · NEC C&C Grant
2020 – 2022
M.Eng. in Energy and Electronics
University of Miyazaki, Japan
Naoji Iwatani Full Master Scholarship Award
2013 – 2018
B.C.Sc. in Computer Science (Knowledge Engineering)
University of Information Technology, Myanmar
Awards & Grants
Encouragement Women Research Award — University of Miyazaki2025
Best Student Paper Award — ICGEC Conference2024
Best Paper Award (Top 1/25) — RISP NCSP Workshop, Hawaii2024
President's Research Funding Award — University of Miyazaki2024–2025
JST Doctoral Researchers Grant2022–2025
NEC C&C Research Grant2022–2023
IEEE Women in Engineering Best Paper Award — LifeTech2021
Naoji Iwatani Full Master Scholarship Award2020–2022
Community & Service
Board Member & Ambassador · 2022 – Present
2022 – Present

I believe access to AI education should not depend on geography, gender, or privilege. Since 2022, I have served as an Ambassador — and now Board Member — of Women in AI Myanmar, helping lead national programs that open doors for students exploring AI for the first time, with a focus on women and underrepresented communities.

In 2025 alone, these programs reached over 900 participants across three initiatives:

153 Responsible AI Workshop
participants
305 ML Summer School applicants
(89 selected for 6-month program)
482 Scholarship knowledge-sharing
session participants

As a trilingual communicator (English · Japanese · Burmese), I also publish AI articles on Medium and speak at events across global communities — translating cutting-edge research into accessible insights for audiences encountering these ideas for the first time.

Workshop Organizer — ASPIRE CV Workshop, University of Oxford & AIST Apr 2026
Session Chair — AAAI & ICAART 2025–2026
Google Developer Group Tokyo — Staff Volunteer 2025–Present
Invited Talks — Google I/O Extended Yangon 2025 · Women in AI ML Summer School 2025 · Sakura Workshop Taiwan 2023 2023–2025
Contact

Open to research collaborations, applied scientist roles, and conversations about multilingual AI safety in the APAC region.