Summary
A report on models with multimodal input and initial evaluations of their quality.
More information & hyperlinks