When the PDF circulates internally, it goes viral. Employees from other offices start “Story‑Sprints” of their own, using the template Ari provided. The HR board announces a “Cultural Storytelling Month,” featuring a competition for the best Dengudu Kāthalu adaptation.
| Step | What the script does | Why it matters for your use‑case |
|------|----------------------|---------------------------------|
| 1️⃣ Scan | Recursively walks the given folder, finds every *.pdf. | No matter how deep your “Telugu story” collection is, the tool sees it all. |
| 2️⃣ Metadata extraction | Uses PyPDF2 to read the PDF’s internal title/author fields (if any). | Many PDFs already carry the story name (“Dengudu Kathalu”, “Family Tales”, etc.) – this makes search much richer than just file‑name matching. |
| 3️⃣ Normalisation | Converts every string to lower‑case, strips punctuation, splits on whitespace, and also keeps Telugu Unicode characters (U+0C00‑U+0C7F). | You can search with English or Telugu words without worrying about case or stray symbols. |
| 4️⃣ Indexing | Stores each PDF as a small dictionary of word‑lists (filename, title, author). | The in‑memory index is tiny (a few KB per file) and lets us do lightning‑fast look‑ups. |
| 5️⃣ Query | You type any combination of words – e.g. telugu family dengudu 56 better. The script returns PDFs that contain all those words somewhere in the indexed fields. | Exactly matches your request: find PDFs that talk about Telugu families, the “Dengudu” story, perhaps a 56‑page edition, or anything you deem “better”. |
| 6️⃣ Open / Export | Selecting a result opens it with the default PDF viewer (or you can easily modify the script to copy the path elsewhere). | One‑click access to the story you need, without manually digging through folders. | work+telugu+family+dengudu+kathalu+pdf+56+better
| Channel | Format | Reach | Remarks | |---------|--------|-------|---------| | WhatsApp Broadcast Lists | PDF (compressed) | 8 000+ households | Tag‑based (by district) | | Anganwadi Centres | Printed hard‑copy (A5) | 3 000 families | Laminated for durability | | School Libraries | PDF + audio MP3 | 1 200 students | Integrated into health‑lesson plans | | Community Health Workers (ASHAs/MPWs) | Tablet pre‑load | 5 000 visits | Offline mode | | Local Radio | Story dramatization (5‑min) | Broad audience | Cross‑promotion of PDF download | When the PDF circulates internally, it goes viral