Vision

Vision is a Multilingual and Multimodal Conversational AI system designed to assist visually impaired individuals. It simplifies daily tasks by managing ToDos, answering complex visual queries, and providing live object detection and scene understanding. Users can interact with Vision by simply double-tapping anywhere on the screen and speaking in natural language.

Demo and Documentation

Features

Conversational AI Interaction: Double-tap anywhere on the screen and talk with Vision in natural language for easy interaction.
On Device Live Object Detection: Speak a command, e.g. "Help me find my laptop", and Vision will automatically activate the camera to locate objects.
Answering Complex Visual Queries and Interpretation: Ask complex queries like "Tell me the expiry date of the medicine here, and also describe the activity of any person present." Vision will capture the image, process the scene, and provide detailed information.
Adaptive Database Services: Interact with the database in natural language. For example, "Add a task to buy groceries on Friday with high priority." Vision will manage tasks using natural language interaction, simplifying daily planning.

Submodules

VisionAPI: This is the backend of the Vision system, responsible for processing and responding to user queries.
VisionAndroid: This is the client-side application that users interact with. It communicates with the VisionAPI to provide responses to the user.

Technology Stack

TensorFlow Lite
FastAPI (Python) for REST API
Torch
Langchain
Docker
Transformers
Image processing
LLM (Language Model)
Prompt engineering techniques

Getting Started

To get a local copy up and running, follow these simple steps:

Visit the main page of the repository.
Navigate to each submodule (VisionAPI and VisionAndroid).
Switch to their main branches.
Clone each submodule individually to your local machine.
After cloning, navigate to the project directory of each submodule.
Install the required packages as mentioned in the respective README files of the submodules.
Run each application as per the instructions provided in their respective README files.

Please ensure you have the necessary environment and tools installed on your machine to run the applications.

Contact

Your Name - rohanvermadev@gmail.com

Project Link: https://github.com/DevOpRohan/Vision

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
VisionAndroid @ 5cf4d5b		VisionAndroid @ 5cf4d5b
VisionApi @ 078aa81		VisionApi @ 078aa81
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision

Demo and Documentation

Features

Submodules

Technology Stack

Getting Started

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Vision

Demo and Documentation

Features

Submodules

Technology Stack

Getting Started

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages