Şevval Alper
Şevval is an AI researcher at AIMultiple. She has previous research experience in pseudorandom number generation using chaotic systems.
Research interests
Şevval focuses on AI coding tools, AI agents, and quantum technologies.
She is part of the AIMultiple benchmark team, conducting assessments and providing insights to help readers understand various emerging technologies and their applications.
Professional experience
She contributed to organizing and guiding participants in three “CERN International Masterclasses - hands-on particle physics” events in Türkiye, working alongside faculty to facilitate learning.
Education
Şevval holds a Bachelor's degree in Physics from Middle East Technical University.
Latest Articles from Şevval
AI Agents: Operator vs Browser Use vs Project Mariner ['26]
AI agents are increasingly marketed as end-to-end digital workers, but real-world performance can vary widely depending on the task, tools, and execution environment. To understand what these systems can genuinely deliver today, we conducted hands-on benchmarking across practical business scenarios.
Speech-to-Text Benchmark: Deepgram vs. Whisper in 2026
We benchmarked the leading speech-to-text (STT) providers, focusing specifically on healthcare applications. Our benchmark used real-world examples to assess transcription accuracy in medical contexts, where precision is crucial. Speech-to-text benchmark results Based on both word error rate (WER) and character error rate (CER) results, GPT-4o-transcribe demonstrates the highest transcription accuracy among all evaluated speech-to-text systems.
AI Coding Benchmark: Best AI Coders Based on 5 Criteria
Most software engineers rely on AI coding assistants at least once a day in 2025.
Vibe Coding: Great for MVP But Not Ready for Production
Vibe coding is a new term that has entered our lives with AI coding tools like Cursor. It means coding by only prompting. We made several benchmarks to test the vibe coding tools, and with our experience, we decided to prepare this detailed guide.
Screenshot to Code: Lovable vs v0 vs Bolt in 2026
During my 20 years as a software developer, I led many front-end teams in developing pages based on designs that were inspired by screenshots. Designs can be transferred to code using AI tools.
AI Code Review Tools Benchmark in 2026
With the increased use of AI coding tools, codebases have become more prone to vulnerabilities, which increased the need for effective code reviews.
E-Commerce AI Video Maker Benchmark: Veo 3 vs Sora 2
Product visualization plays a crucial role in e-commerce success, yet creating high-quality product videos remains a significant challenge. Recent advancements in AI video generation technology offer promising solutions.
AGI Benchmark: Can AI Generate Economic Value in 2026
AI will have its greatest impact when AI systems start to create economic value autonomously. We benchmarked whether frontier models can generate economic value. We prompted them to build a new digital application (e.g., website or mobile app) that can be monetized with a SaaS or advertising-based model.
Top 4 AI Search Engines Compared in 2026
Searching with LLMs has become a major alternative to Google search. We benchmarked the following AI search engines to see which one provides the most correct results: Benchmark results Deepseek is the leader of this benchmark, by correctly providing 57% of the data in our ground truth dataset.
AIMultiple Newsletter
1 free email per week with the latest B2B tech news & expert insights to accelerate your enterprise.