Develop a comprehensive web application designed for the extraction and verification of information from semi-structured data, specifically tailored for the DRDO interview process. The application should feature a multi-menu structure, ensuring all functionalities are operational and presented in an attractive and user-friendly interface.
Ensure the application is scalable and robust, capable of handling a large volume of user data and document uploads. The design should prioritize user experience, with clear instructions, intuitive navigation, and responsive design for optimal viewing across various devices. The project should be substantial in scope, reflecting the complexity of the data extraction and verification processes required for the DRDO interview process.
I'll create a comprehensive web application for the DRDO interview process with authentication, document uploads, and verification features.
This comprehensive web application for the DRDO interview process includes:
Authentication System:
Dashboard:
Multi-step Form:
Document Verification:
Conditional Logic:
The application is built with a responsive design that works across all device sizes and follows accessibility best practices. The UI is clean and professional, making it easy for users to navigate through the complex application process.
To use this application:
npm run dev
in reiew and submit area , personal information doesnt show the user entered details it shows a fixed details but it should show the user entered details in the personal information page and also in the document uploading page it should check whether the user is uploading the correct document like if the document asked to submit is caste certificate but if the user submitted resume or any other documents it should not accept and also add aadhar submission on the document page and the main theme of the project is to verify the documents based on the user entered details by comparing with documents submitted i have the code for text extraction # Import the OCR tamil package from ocr_tamil.ocr import OCR import torch
from ocr_tamil.strhub.models.parseq.system import PARSeq from ocr_tamil.strhub.models.parseq.modules import Encoder torch.serialization.add_safe_globals([PARSeq, Encoder])
ocr = OCR(detect=True)
texts = ocr.predict('obc_testing_img.jpg')
print(texts[0])
I'll enhance the application with document verification functionality and fix the review page to show actual user-entered details.
I've enhanced the DRDO interview portal with the requested improvements:
Dynamic Review Page:
Document Verification System:
Aadhaar Card Requirement:
Document Verification Service:
The application now properly verifies documents by comparing the content with expected keywords and patterns. While this implementation simulates the OCR functionality in the frontend, in a real application, you would connect to a backend service that uses the Python OCR code you provided.
The verification system supports multiple languages as mentioned in your requirements, including Tamil and Hindi, by simulating the extraction of text in those languages based on the document type.
use this text extraction import pytesseract from pdf2image import convert_from_path import re
def extract_english_text(pdf_path): # Convert PDF pages to images images = convert_from_path(pdf_path)
all_text = ""
for image in images:
# Perform OCR on the image
text = pytesseract.image_to_string(image)
# Filter only English text using regex
english_text = re.sub(r'[^A-Za-z0-9.,!?;:\'/"\"()\s]', '', text)
all_text += english_text + "\n"
return all_text
verify the user entered details with the code extracted text and then verify and add more css and make this app more attractive and functional and also add some images dont leave blank