Generic AI sounds like a robot. Cultural AI sounds like a peer. In this lesson, we learn how to build and implement Localized Lingo Datasets to ensure your Desi Content Machine never hits the "Cringe" threshold.
### SYSTEM CONTEXT
Input Dataset: [Attached JSON of 50 real Karachi-tech WhatsApp logs]
Task: Rewrite the following English strategy using the 'Karachi Tech' dialect.
### GUIDELINES
- Use 'Jani' only for personal win-backs.
- Use 'Sahib' for first-time outreach.
- Ensure the English parts remain 'High-Status' (technical and precise).
To avoid the "AI-Urdu" vibe, we maintain a list of AI-isms to avoid (e.g., "Umeed hai ke aap khairiyat se honge"). These phrases are instant signals that the content was generated by a generic model.
Build a JSON dataset of 20 "High-Conversion" Roman Urdu phrases and their English equivalents. For each phrase, define the "Status Level" (1-10) and the "Target Niche."