Full Text
**User's Objective:** Extract specific English text elements from the initial title/masthead section and the *complete* main English body text from a bilingual Indian Gazette PDF, skipping the main Hindi body text. Preserve formatting accurately, including table formatting.
**OCR Output Analysis & Plan:**
The OCR output provides the text from 24 pages. I will go through it step-by-step following the instructions.
**1. Initial Title/Masthead Extraction (Page 1):**
* "REGD. No. D. L.-33004/99" - English, include.
* "The Gazette of India" - English, include.
* "CG-UP-E-20012026-269451" - English, include.
* "EXTRAORDINARY" - English, include.
* "PART II-Section 3-Sub-section (ii)" - English, include.
* "PUBLISHED BY AUTHORITY" - English, include.
* "No. 223]" - English, include.
* "NEW DELHI, FRIDAY, JANUARY 16, 2026/PAUSHA 26, 1947" - English, include.
All other text on Page 1 before the "MINISTRY OF RAILWAYS" section is Hindi and will be skipped.
**2. Skip Hindi Body (Pages 1-14):**
* The OCR output clearly shows a large block of Hindi text and a Hindi table from Page 1 (after the masthead) up to Page 14. This will be completely skipped.
* Repetitive headers like "THE GAZETTE OF INDIA : EXTRAORDINARY" and "[PART II-SEC. 3(ii)]" on pages 2, 4, etc. are navigational and will be skipped as per instruction #5.
**3. Extract English Body (Page 14 onwards):**
* On Page 14, the English body starts with:
"[F. No. NCR-HQ0CENG(PH 15)/5/2021- O/O Dy CE/C/ALD (Computer No.51223) Dated: 24.11.2021]"
"HIMANSHU GOSWAMI, Chief Administrative Officer (Construction)"
"MINISTRY OF RAILWAYS"
"[North Central Railway (Construction Department)]"
"NOTIFICATION"
"New Delhi, the 9th January, 2026"
"S.O. 246(E).—Whereas, by the notification of the Government of India, in the Ministry of Railways (North Central Railway, Construction department), Notification number, S.O. 4777 (E) dated the 22nd October, 2025 published in Gazette of India, Extraordinary, Part II, section 3 Sub-section (ii), dated the 08th October, 2025 (hereinafter referred to as the said notification) and issued under sub-section (1) of section 20A of the Railways (Amendment) Act, 2008 (11 of 2008) (hereinafter referred to as the said Act), the Central Government declares its"
* The content from Page 15 is entirely English body text.
* The introductory paragraphs.
* The "SCHEDULE" heading.
* The "Brief description..." paragraph.
* **The Table:** This is the most complex part. I need to parse the columns and reconstruct it using `|`, `+`, `-` as specified. The table spans multiple pages (15-23).
**4. Language Handling within English Body:**
* As seen in the sample, terms like "Dainik Jagran", "The Times of India", "Mirzapur", "Chunar Tehsil" are part of the English text and will be included.
* Numerals and parenthetical citations like "(11 of 2008)" will be included.
**5. Header/Footer Handling:**
* Initial English masthead on page 1 is included.
* Repetitive headers ("THE GAZETTE OF INDIA : EXTRAORDINARY", "[PART II-SEC. 3(ii)]") on subsequent pages (2, 4, 6, 8, 10, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24) will be *excluded*.
* The footer on page 24, "Uploaded by Dte. of Printing at Government of India Press, Ring Road, Mayapuri, New Delhi-110064 and Published by the Controller of Publications, Delhi-110054." is part of the document's official information and should be included. "GORAKHA NATH YADAVA Digitally signed by GORAKHA NATH YADAVA Date: 2026.01.20 12:26:48 +05'30'" also seems like an official part.
**Table Reconstruction Strategy:**
The table has 7 columns: S.N., Village Name, Plot No., Type of Land, Nature of Land, Acquired Area (in Hect.), Land Owner / Beneficiary Name.
The last column "Land Owner / Beneficiary Name" takes up a lot of vertical space due to multiple names per row. I will need to ensure proper alignment and column separation.
Let's start extracting.
---
**Detailed Extraction Steps for the Table:**
The table starts on page 15 and continues through page 23. I'll need to combine entries from these pages.
The column headers are:
S.N. | Village Name | Plot No. | Type of Land | Nature of Land | Acquired Area (in Hect.) | Land Owner / Beneficiary Name
I will reconstruct the table row by row, ensuring all names under "Land Owner / Beneficiary Name" are included for each S.N./Plot No. entry.
**Page 15 Table:**
* **Row 1:**
* S.N.: 1
* Village Name: Pirallipur
* Plot No.: 104
* Type of Land: Private
* Nature of Land: Agriculture
* Acquired Area (in Hect.): 0.0570
* Land Owner / Beneficiary Name:
* Golu Minor 11 years S/o Ganesh C/o Mother Savita
* Abha Minor 9 years D/o Ganesh C/o Mother Savita Devi
* Savita Devi W/o Ganesh
* Mahesh Kumar S/o Budhu
* Urmila Devi W/o Budhu
* Seema Singh W/o Vrajanand
* Pradeep Kumar S/o Dhawal Singh
* Ravindra Kumar S/o Hari Pratap Singh
* Parasnath S/o Ramji Yadav
* Rajnarayan S/o Ramji Yadav
* Lal Bahadur S/o Birbal
* Geeta Devi W/o Shyam Sundar
* Raj Kumar S/o Birbal
* Manju W/o Bal Kishun
* Sukhiya Devi W/o Chhotey Lal
* Sitabi Devi W/o Hira Lal
* Ganga Ram S/o Badri
* Dhanesara Devi W/o Badri
* **Row 2 (partially visible on page 15):**
* S.N.: (empty)
* Village Name: (empty)
* Plot No.: 108
* Type of Land: Private
* Nature of Land: Agriculture
* Acquired Area (in Hect.): 0.0060
* Land Owner / Beneficiary Name:
* Chhote Lal S/o Dangar
* Bhaggu S/o Ramman
* Jaggu S/o Ramman
* Kanhaiya Lal S/o Jairam
* Girdhari Lal S/o Jairam
**Page 16 Table:**
* **Row 2 (continuation from page 15):**
* Land Owner / Beneficiary Name (continued):
* Nand Kishore S/o Jairam
* **Row 3:**
* S.N.: 2
* Village Name: Dhaorahara
* Plot No.: 184
* Type of Land: Private
* Nature of Land: Agriculture
* Acquired Area (in Hect.): 0.0640
* Land Owner / Beneficiary Name:
* Ramesh Kumar S/o Samarbahadur
* Rajesh Kumar S/o Indrabahadur
* Sushila Devi W/o Ramesh Kumar
* Mamta Devi W/o Satyendra Kumar
* Satyendra Kumar S/o Samar Bahadur
* Uday Pratap S/o Ratnesh Kumar
* Shashi Kumar S/o Ratnesh Kumar
* Sandhya Singh W/o Ratnesh Kumar
* Rajaram S/o Munna Lal
* Gulab Chandra S/o Munna Lal
* Panna Lal S/o Munna Lal
* Rajesh Kumar S/o Munna Lal
* Rajendra Kumar S/o Jitendra Bahadur
* Shail Kumari W/o Jitendra Bahadur
* Kanhaiya Lal S/o Nauki Lal
* Amrit Lal S/o Lallu
* Jhulni urf Kalawati W/o Kallu
* Rambali S/o Lallu
* Smriti urf Simrit Lal S/o Lallu
* **Row 4:**
* S.N.: 3
* Village Name: Pachewara
* Plot No.: 504
* Type of Land: Private
* Nature of Land: Agriculture
* Acquired Area (in Hect.): 0.0280
* Land Owner / Beneficiary Name:
* Krishan Murari S/o Jagdish Prasad
* Munib Yadav S/o Sachan Yadav
* Bansh Narayan S/o Sachan Yadav
* Ekta Singh W/o Sankalp Singh
* Urmila Devi W/o Chandrashekhar Singh
* Neeta Yadav W/o Shyamu Yadav
* **Row 5:**
* S.N.: (empty)
* Village Name: (empty)
* Plot No.: 505
* Type of Land: Private
* Nature of Land: Agriculture
* Acquired Area (in Hect.): 0.0230
* Land Owner / Beneficiary Name:
* Krishan Murari S/o Jagdish Prasad
* Munib Yadav S/o Sachan Yadav
* Bansh Narayan S/o Sachan Yadav
* Ekta Singh W/o Sankalp Singh
* Urmila Devi W/o Chandrashekhar Singh
* Neeta Yadav W/o Shyamu Yadav
* Gayatri Devi W/o Sohan Singh
* Shamsher Bahadur S/o Sohan Singh