An innovative attendance system that uses voice recognition to mark attendance efficiently and securely. This project leverages audio processing and speech recognition to streamline the attendance process in classrooms, workplaces, or events.
- Speech Recognition: Converts spoken input into text to mark attendance.
- Voice Authentication: Validates identity using voice biometrics (optional).
- Real-Time Processing: Instantly logs attendance.
- Secure Data Storage: Stores attendance data in a database with encryption.
- Accessible: Suitable for visually impaired individuals or hands-free environments.
- Programming Language: Python
- Audio Processing: SpeechRecognition library, pyannote.audio
- Database: Supabase (PostgreSQL backend)
- Frameworks: Flask or Streamlit (for UI)
- Cloud Services (optional): Google Speech-to-Text, Azure Speech Services
- Hardware: Microphone or audio input device
Below is a GitHub-friendly textual representation of the workflow:
- Professor's Laptop: Audio is recorded in real-time.
- Audio Submission: The professor submits the audio to a secure database (encrypted).
- Students' Laptops: Students retrieve the professor's audio after submission.
- Audio Comparison: Comparison is performed locally on students' laptops between the live room audio and the submitted professor's audio.
- Match Found: Attendance is marked.
- No Match Found: Attendance is denied.
- Dashboard Update: Attendance records are updated for both students and professors.
+-------------------+
| Professor Laptop |
| (Audio Recording) |
+-------------------+
|
v
+-------------------+
| Submit Audio to |
| Database (Encrypt)|
+-------------------+
|
v
+-------------------+
| Students Retrieve |
| Audio for Local |
| Comparison |
+-------------------+
|
v
+-------------------+
| Compare Live Room |
| Audio with |
| Submitted Audio |
+-------------------+
/ \
v v
+-------+ +--------+
| Match | | No |
| Found | | Match |
| Mark | | Deny |
| Attend| | Attend |
+-------+ +--------+
|
v
+-------------------+
| Dashboard Update |
+-------------------+
- Professor Audio Submission: Professors record live audio on their laptops and submit it to the database, where it is stored in encrypted form.
- Student Audio Comparison: Students retrieve the professor's submitted audio and compare it with live audio recorded on their local devices.
- Attendance Marking: Attendance is marked based on the audio comparison result.
- Proxy Mitigation:
- VPN proxies are detected and mitigated.
- Sharing pre-recorded audio is ineffective as live audio differences are detectable.
- The system ensures privacy without using ML/DL models or biometric data.
- The login process for student is different here. It checks if the user is logging in from the regular device or not.
- if not, then student should go through the forgot/generate process where the request will be sent to admin to approve the login from new device and then only new device will be registered and old device ID will be deleted.
- This process ensures that user can do secure login from one device.
- VPN proxy will mitigated.
- sharing the recorded audio will be mitigated as the application can find the difference in live audio and recorded audio.
- It is much more efficient than OTP-based attendance.
- NOT using ML/DL models just student biometric information privacy.
- Noise-cancellation for improved accuracy.
- Analytics dashboard for attendance trends.