JoshCyril | Portfolio

The project was undertaken to address the limitations of traditional bookmark management systems, such as the inability to handle multimedia content and lack of semantic search capabilities.
The goal was to develop an advanced web-based bookmark search engine that supports multi-modal and contextually relevant searches.

Develop a system enabling semantic and visual bookmark retrieval using text, image, and code searches.
Implement role-based access control with separate admin and member roles.
Create a robust backend supporting vector database integration for high-performance search capabilities.
Provide a user-friendly frontend for bookmark management and multi-modal search execution.

Integrating ChromaDB and Weaviate with the Laravel backend for seamless vector search functionality.
Processing diverse content types, including text, images, and code, to generate embeddings for semantic and visual searches.
Designing an interactive user interface for displaying search results and their relevance.
Ensuring efficient handling of large datasets while maintaining near real-time search speeds.

Started with requirement analysis and database schema design.
Implemented the Laravel backend using MVC architecture for clear separation of concerns.
Integrated vector databases for semantic searches and developed API endpoints for operations like web scraping, embedding generation, and search execution.
Built the frontend with TailwindCSS and FilamentPHP for a responsive and interactive user experience.
Conducted iterative testing to optimize performance and ensure feature completeness.

Frontend: TailwindCSS, FilamentPHP, AlpineJS.
Backend: Laravel PHP with MVC architecture.
Vector Databases: ChromaDB for text searches and Weaviate for image searches.
Tools: Docker, RESTful APIs for backend operations.
Database: Relational database with tables for users, collections, and URLs.

Developed APIs for web scraping, metadata extraction, and embedding generation.
Integrated vector databases for context-aware searches using vector distance calculations.
Designed a responsive UI enabling users to create and manage collections and perform multi-modal searches.
Visualized search results using t-SNE for intuitive representation of relationships among stored URLs.

Delivered significant improvements in search accuracy and speed with near real-time results, even for large datasets.
Enhanced user experience through intuitive role-based access and interactive visualizations.
Successfully showcased the broader applicability of vector databases in advanced information retrieval.

Gained insights into integrating vector databases with traditional backends like Laravel.
Highlighted the importance of modular architecture for handling diverse content types efficiently.
Recognized the potential of multi-modal search capabilities in improving user interaction with bookmark tools.

Expand the system’s multi-modal capabilities by refining embeddings and enhancing model training.
Add support for handling larger datasets with optimized data processing pipelines.
Introduce additional features such as collaborative collections and personalized recommendations.

Title	Description	Link
Code	GitHub Code for main site	https://github.com/JoshCyril/Bookmark-SearchEngine
API Code	GitHub Link for API code	https://github.com/JoshCyril/Bookmark-se-backend
weaviate	weaviate website - Image search	https://weaviate.io/developers/weaviate
Chromadb	Chromadb vector database and text, code search	https://docs.trychroma.com/

Bookmark-se