Full Text
**Explanation of the thought process behind extracting the English text based on the instructions:**
1. **Understand the Goal:** The primary goal is to extract *only* English text, preserving formatting (paragraphs, line breaks, tables) from two distinct sections: the initial title/masthead and the main body. Crucially, the main Hindi body needs to be *skipped*.
2. **Initial Scan for Structure:**
* **Page 1:** Looks like the title/masthead. There's Hindi and English side-by-side or interspersed. The core English elements need to be picked out.
* **Page 1 (lower), Page 2, Page 3:** Predominantly Hindi. A large Hindi table is visible. This is the "Main Hindi Body Section" to be skipped.
* **Page 4:** The Hindi table finishes. Then there's an English file number, and then clear English headings like "MINISTRY OF ROAD TRANSPORT AND HIGHWAYS" and "NOTIFICATION". This marks the start of the "Main English Body Section".
* **Page 5, Page 6, Page 7:** These pages contain the bulk of the English notification and a detailed English table.
3. **Step-by-Step Extraction - Title/Masthead (Page 1):**
* `रजिस्ट्री सं. डी.एल.- 33004/99` and `REGD. No. D. L.-33004/99`: Extract `REGD. No. D. L.-33004/99`.
* `भारत का राजपत्र` and `The Gazette of India`: Extract `The Gazette of India`.
* `सी.जी. डी.एल.-अ.-24042026-272061` and `CG-DL-E-24042026-272061`: Extract `CG-DL-E-24042026-272061`.
* `असाधारण` and `EXTRAORDINARY`: Extract `EXTRAORDINARY`.
* `भाग II- खण्ड 3-उप-खण्ड (ii)` and `PART II—Section 3—Sub-section (ii)`: Extract `PART II—Section 3—Sub-section (ii)`.
* `प्राधिकार से प्रकाशित` and `PUBLISHED BY AUTHORITY`: Extract `PUBLISHED BY AUTHORITY`.
* `सं. 2014]` and `No. 2014]`: Extract `No. 2014]`.
* `नई दिल्ली, शुक्रवार, अप्रैल 24, 2026/वैशाख 4, 1948` and `NEW DELHI, FRIDAY, APRIL 24, 2026/VAISAKHA 4, 1948`: Extract `NEW DELHI, FRIDAY, APRIL 24, 2026/VAISAKHA 4, 1948`.
* At the very bottom of page 1, `2939 GI/2026` appears. This fits the "standalone identifiers" criteria for the initial section. `(1)` is a page number, so it's excluded.
4. **Skipping Hindi Body (Lower Page 1, Page 2, Page 3, Upper Page 4):**
* The text "सड़क परिवहन और राजमार्ग मंत्रालय अधिसूचना..." until the end of the Hindi table on page 4 (which is just above `[फा. सं. RW/JAI/RJ/CE-RO/NH-12012/2025/02/3D]`) is entirely Hindi and is skipped.
* Headers like "2 THE GAZETTE OF INDIA : EXTRAORDINARY [PART II-SEC. 3(ii)]" on page 2, "भारत का राजपत्र : असाधारण 3" on page 3, and "4 THE GAZETTE OF INDIA : EXTRAORDINARY [PART II-SEC. 3(ii)]" on page 4 are navigational headers/footers appearing *before* the main English body starts, but they are *not* part of the initial title/masthead. The instruction says "Exclude repetitive navigational headers or footers... if they appear identically at the top/bottom of *subsequent* pages *after* the main English body has started." This implies that headers *before* the English body are also generally excluded unless they are part of the initial masthead. Given the strong instruction to *skip the Hindi body*, and these headers appearing within/above the Hindi body or as general page headers, they should be excluded. I'll stick to only the clearly identifiable English elements from the *initial title/masthead section* for the first part of the output, and then the *complete main English body text*.
5. **Extraction - Main English Body (Lower Page 4, Page 5, Page 6, Page 7):**
* **Page 4:**
* `[F. No. RW/JAI/RJ/CE-RO/NH-12012/2025/02/3D]` - This is an English file number/identifier, part of the main English content prelude. Include.
* `MINISTRY OF ROAD TRANSPORT AND HIGHWAYS`
* `NOTIFICATION`
* `New Delhi, the 20th April, 2026`
* `S.O. 2087(E).— Whereas by the notification of the Government of India in the Ministry of Road Transport` (careful with the hyphens and line breaks)
* `and Highways, 3286-E Dated:18/07/2025, published in the Gazette of India, Extraordinary, Part II, Section 3, Sub-`
* `section (ii) issued under sub-section (1) of section 3A of the National Highway Act, 1956 (48 of 1956) (hereinafter`
* `referred to as the said Act), the Central Government declared its intention to acquire the land specified in the Schedule`
* **Page 5:**
* Skip navigational header: `[भाग II-खण्ड 3(ii)] भारत का राजपत्र : असाधारण 5`
* `annexed to the said notification for construction of Karauli Bypass, Maintenance, management and operation of NH23`
* `in the stretch of land from Km. 0 to Km. 14 in the district of KARAULI in the state of RAJASTHAN`
* `And whereas the substance of the said notification has been published in "The Indian Express" dated`
* `01/08/2025, “NIRBHIK RAJASTHAN" dated 01/08/2025 ; under sub-section (3) of section 3A of the said Act;`
* `And whereas the Competent Authority has received objections filed under Section 3-C, considered and settled`
* `the same appropriately;`
* `And whereas, in pursuance of sub-section (1) of section 3D of the said Act, the competent authority has`
* `submitted its report to the Central Government;`
* `Now, therefore, upon receipt of the said report of the competent authority and in exercise of the powers`
* `conferred by the sub-section (1) of section 3D of the said Act, the Central Government hereby declares that the land`
* `specified in the said Schedule should be acquired for the aforesaid purpose;`
* `And further, in pursuance of sub-section (2) of section 3D of the said Act, the Central Government hereby`
* `declares that on publication of this notification in the Official Gazette, the land specified in the said Schedule shall vest`
* `absolutely in the Central Government, free from all encumbrances.`
* `SCHEDULE`
* `Brief description of the land to be acquired, with or without structure, falling within the NH23 in the stretch of land`
* `from Km. 0 to Km. 14 in the district of KARAULI in the state of RAJASTHAN`
* `State: RAJASTHAN` and `District: KARAULI` (These look like sub-headers for the table).
6. **Table Formatting and Extraction (Pages 5, 6, 7):** This is the most complex part.
* Carefully map the columns and content.
* Handle multi-line headers.
* Ensure proper use of `|`, `+`, `-`, and spaces for alignment.
* The "Name of Land Owner/Interested Person" column is very wide and contains numbered lists. I need to make sure these lists are preserved within the table cell.
* The table spans multiple pages, so I need to connect the rows correctly.
Let's refine table header row and separators:
The headers 'Sl. No.', 'Survey/Plot Number', 'Area (in Hectares)' are presented in a two-line format in the PDF. I'll represent this explicitly.
```
+------+-------------------+--------------+--------------+------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Login to read full text