-
Notifications
You must be signed in to change notification settings - Fork 35
Description
Summary
This the issue to start gathering all the information needed to migrate the content from of weblectures and talks from CDS to CDS Videos.
Why is important?
As part of the CDS migration to https://repository.cern we need to move content from the legacy system to the new one. Specifically, part of the archived video content was already migrated to https://videos.cern.ch some years ago. Now, we will focus on the remaining video content which is the Lectures and Events collection. Additionally, this is important so that users can find all video related archived content in one platform (CDS Videos) and avoid confusion between the 2 systems.
External dependencies
- Academic training website is harvesting information from CDS for their content.
- Contact the website responsible to align with them if they are able to harvest CDS Videos due to the upcoming migration.
RCS-SISare the responsible for the website.- Their harvest code should just change to use JSON and map the fields to the ones on CDS Videos.
- Important preserve embedding links e.g. https://cds.cern.ch/video/2792031?showTitle=true . We need to implement redirection from legacy embed to videos embed. Now, we have only redirection between legacy records to video records
- Contact the website responsible to align with them if they are able to harvest CDS Videos due to the upcoming migration.
- Contact: https://scivideos.org/source-repository/CERN-CDS and see how do they harvest CDS talks.
- OAI harvest or simple search on Videos would be enough?
- They will be happy to just use REST API for search
- OAI harvest or simple search on Videos would be enough?
- Once we migrate, we need to ensure that they are informed and capable of harvesting the content from CDS Videos.
- CAS project requirements
Weblectures publishing workflow
At the moment the Weblecture team is creating on CDS the weblecture record metadata while the video is not stored on our system. An example record can be found here.
- Contact the weblecture team to align on the changes needed to be done regarding the publishing workflow.
- CDS Videos as the archival repository should ideally be able to download the content, weblecture main file and subformats, and archive it in-house.
- A potential solution would be to get notified when a new weblecture is created/updated and retrieve the manifest, example to be able to download and create the CDS Videos record.
- That would require the CDS Videos platform to be able to download content via an API key from the weblecture storage.
- We need to provide a json template to the weblecture team so it can be used for the metadata collection for the corresponding weblecture record.
- A potential solution would be to get notified when a new weblecture is created/updated and retrieve the manifest, example to be able to download and create the CDS Videos record.
- We should not mint DOIs for the weblecture videos
- Maybe is a good idea to make DOIs optional e.g users to request them on demand
Weblectures collections
Currently on CDS all relevant content that needs to be migrated is included in the Lectures and Events collection.
- How should we migrate the collection structure?
- On Videos we have the "Channels" on the front page. Investigate if they can be used to collect and display the weblecture content in similar fashion so people can browse the content.
Welecture publishing workflow information
- Here are the notes after our first meeting with the recording service team: https://codimd.web.cern.ch/IsDVkP2OQRmGFavFDZW16A
- File size quotas described in https://codimd.web.cern.ch/QpOTsuovS16CzwtKBu-ndQ
- Total size(master + subformats): 118TB
Tasks
Design
- Analyse migration (fields) from CDS Marc21 to CDS Videos data model #1955
- Define how the automatic recording system can publish on CDS Videos #1956
- Technical design of the WebLecture video type #1957
- Prototype WebLecture video type #1958
- Design how to migrate the collections structure #1959
Development
- Make DOI optional (for Weblecture video type and/or all video types) #1960
- Implement the integration with recording system () #1961
- Implement the collections and search () #1962