Back to Projects

      Lead Engineer

      ID Scanner Application

      A mobile application designed to scan identity documents and extract structured data entirely on-device, prioritizing privacy, performance, and reliability.

      React NativeVision CameraGoogle ML KitComputer VisionMobile

      Problem

      Real-world ID scanning involves poor lighting, motion blur, damaged documents, and inconsistent layouts. Cloud-based OCR introduces latency and privacy concerns, especially in regulated environments.

      Solution

      Built a fully on-device processing pipeline that detects documents, guides users during capture, and extracts structured data using optimized computer vision and OCR techniques.

      Architecture & Approach

      React Native application using react-native-vision-camera with custom frame processors. Google ML Kit is used for OCR and face detection. All processing runs locally without requiring cloud connectivity.

      Key Tradeoffs

      • On-device processing limits model flexibility but eliminates latency and privacy risks
      • Supporting many document types requires continuous updates as formats evolve
      • User guidance during capture is more effective than post-processing low-quality images

      What I Learned

      • Edge ML requires careful optimization to meet performance constraints
      • Real-world data is significantly messier than sample datasets
      • Immediate visual feedback during capture dramatically improves accuracy