Quantcast
Channel: Machine Learning
Viewing all articles
Browse latest Browse all 62673

[AskML] Finding tv show name and season/episode number in unpredictable filenames -- is ML right for me?

$
0
0

Hello /r/ML! This is my first post here, so please be gentle!

I'm writing a python script that cleans up file names of TV shows based on information scraped from thetvdb.com.

In the overwhelming majority of cases, the name of the show, along with the season/episode information is right in the filename. When this is the case, I'd like for my scraper to be able to pick it out of the other crap in the filename so that I can use it to look up information on thetvdb.com.

So again, in a nutshell, I'd like to :

  • Pick out tv show name, season and episode information from an unpredictable string of characters (filename)

  • Detect a state in which no tv and/or season/episode information is present in the filename.

Does this sound like something a naive baysean classifer could do? I ask because this is the only type of classifier I think I'd be able to implement straight away. If not, could you please point me in the right direction?

As always, tutorials and code snippets are quite welcome!

Thanks in advance!

submitted by omginternets
[link] [8 comments]

Viewing all articles
Browse latest Browse all 62673

Trending Articles