im2txt is a ML model that can take in images, and generate captions describing the scene.
The performance published in the original paper shows results comparable to human performance in certain accuracy/quality metrics. However, due to the relatively small dataset and types of examples available in the training (learning) dataset, the performance varies in the real world.
The code for the model/training is licensed under Apache License 2.0 and the trained model weights is licensed under MIT.