sample_media_properties.md#
Title#
The title of the work. If blank, uses “This work” for the attribution sentence (This work by creator is licensed with CC BY). Shape of the data and Selection criteria
We select the default title returned by the provider. It can be blank. Blank values (whether None or empty string “”) are saved as empty string in the database (TODO: check if this is true). Existing data problems
Some media items had incorrectly encoded titles [^1 - Link to a description of Unicode encoding problem in the “postamble”]. This is compensated for in the Frontend (link to the code that fixes title encoding). This problem has been fixed for the items that have been reingested after some time in 2020, but might still persist for items that were not updated since then. Link to issues for fixing the encoding in the catalog/api/frontend. Some Wikimedia titles have a shape of “FILE:xxx.svg”. The provider script removes them now, but this is still a problem for items that were ingested earlier. The “FILE” and “.extension” are removed in the frontend (link to the code). Link to the issue to fix it in the API.
identifier
#
Used for image
and audio
Openverse identifier
is generated during the
ingestion process when the image is inserted into the image
table for the
first time.