We’ve reached the mid-way point of the MeMAD project, and during that time we have developed plenty of exciting state-of-the-art and competition-winning stuff to show off: If you haven’t already, we encourage you to check out the MeMAD Github repository!
Some of the cool stuff you can find there include:
- Audio tagging
- Deep captioning
- Face detection
- Neural machine translation
- Automatic speech recognition
- Speech segmentation
- Tools for image and video analysis, indexing, and description