I’m trying to index news feeds of podcasts, i. e. downloadable files. But I encounter the following problems:
When I import the podcast feeds (e. g. https://www.deutschlandfunkkultur.de/podcast-literatur.1972.de.podcast.xml), they are indexed correctly. But when I run a search, the link of the search result does not point to the audio file. It would be really nice if the download link would be presented in the search results. Can I write an extension which does this? Or could this feature be integrated in the current source code?
When I use an advanced crawl job which is started on a page with links to several podcast feeds, the feeds are not recognized as such and thus indexed like a web page. Shouldn’t the crawler recognize the feeds as such and index them accordingly? Example:
start url = https://www.deutschlandfunkkultur.de/podcasts.2502.de.html?drpp%3Ahash=displayAllBroadcasts