Ibrahim B.Tech
Prompt Engineering | AI Evaluation | Localization Specialist
WhatsappFreelancer
years of experience
Satisfied Clients
QA & Evaluation Hours
Completed Projects
I’m a technically inclined freelance professional with 8+ years of experience in translation, annotation, localization, and structured content evaluation across legal, medical, technical, and IT domains. I specialize in AI prompt engineering, response evaluation, and LLM benchmarking, where I review structured AI outputs, validate quality against gold standards, identify hallucinations and logical gaps, and provide actionable feedback to improve model performance. My work involves task categorization by topic, difficulty, and language level, along with detailed error classification and root-cause analysis. With a strong analytical mindset and attention to detail, I’m experienced in working with complex multilingual content, designing reproducible QA workflows, and maintaining high accuracy under strict quality benchmarks.
contact meHigh-quality multilingual translation with structured QA checks. Strong expertise in terminology validation, intent preservation, and cultural accuracy across legal, medical, technical, and IT content.
Accurate transcription with consistency checks, formatting standards, and speaker validation. Experienced in reviewing AI-assisted transcription outputs.
Designing, testing, and optimizing prompts. Evaluating AI responses for accuracy, hallucinations, clarity, tone, and alignment with expected outputs.
Annotation and labeling for NLP, MT, ASR, and LLM training datasets. Focused on guideline adherence, edge-case detection, and quality consistency.
Comparing model outputs against gold standards and human references. Performing error analysis, root-cause review, and documenting failure patterns.
Supporting AI and localization pipelines using structured documentation, validation scripts, Git-based reviews, and QA tools.