QuentinFuxa / WhisperLiveKit
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI.
See what the GitHub community is most excited about this week.
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI.
E-mails, subdomains and names Harvester - OSINT
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join our discord: https://discord.gg/ejRNvftDp9
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API endpoints to MCP, composes virtual MCP servers with added security and observability, and converts between protocols (stdio, SSE, Streamable HTTP).
📚 Freely available programming books
Mobile-Agent: The Powerful GUI Agent Family
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
All Algorithms implemented in Python
Generate audiobooks from e-books
SoTA open-source TTS
🎯 告别信息过载,只看真正关心的新闻 - 多平台热点聚合工具,一键监控今日头条、百度热搜、微博、抖音、知乎、B站等35个平台,智能关键词筛选,自动生成热点分析报告。支持企业微信、飞书、钉钉、Telegram推送,30秒网页部署,1分钟手机通知,�?需编程基础。也支持docker私人部署⭐ 让算法为�?服务,而非被算法绑架
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
the only cheat sheet you need
A collection of projects showcasing RAG, agents, workflows, and other AI use cases