If propeietary/SaaS fits your use case I can reccomend Speechmatics. Has a wider range of languages than a lot of the competition: https://speechmatics.com
Will it work with say - someone speaking English with some hindi mixed in? I'm not from there so I'm not sure how prevalent that is, but I've been told it's quite common to "mix it up" in India, and I need to probably cater for that use case.
PS if you can share your email I'll pop you an email about Speechmatics. I tried the English version and it's impressive.
This is definitely the sort of use case we aim to support! I would need to check about Hindi specifically, but we have several bilingual models already with more to come:
Hi, I'm an engineer at Speechmatics. Our speech-to-text software handles speaker diarization very reliably, and we're a go-to choice for non-English languages. https://www.speechmatics.com/
I made a web app which gives you a breakdown of an image's color distribution. Lets you pick color schemes from a subset of the image's colors (specifically, those which form a convex hull, when visualizing the colors in RGB color space).
(Note: There are still some issues with showing images on iOS browsers)
I mainly made this to satisfy my own curiosity. I had the idea of forming a convex polytope based on an image's colors, and I thought it would be cool to be able to explore that in a responsive way through a graphical UI. Also it was a great way to learn Svelte and ThreeJS, which in many ways work rather well together! If anyone is curious to see the source code, I'm happy to link it too.
(Full disclosure I'm an engineer there)