What is the current state of reproducible science in the field of machine...
Writing my bachelor's thesis, I noticed that I had several problems with my workflow:Keeping a journal and analyze one's test results is difficult. Doing so and in the meantime running several other...
View ArticleNew open-source Machine Learning Framework written in Java
submitted by datumbox [link][7 comments]
View ArticleSelf-study Linear Algebra textbook for ML and Stats
I am looking for a good linear/matrix algebra textbook, suitable for self-study, that covers topics relevant to statistics and machine learning. I have access to Gentle's "Matrix Algebra", but have...
View ArticleHow does Aritificial Neural Network work - simplified
submitted by kunalj101 [link][1 comment]
View ArticlePandas 0.15.0 supports categorical values in data frames.. will this now work...
Pandas update - http://pandas.pydata.org/pandas-docs/stable/whatsnew.htmlI had tried to run some previous work I'd done in R through Python using Pandas and sklearn (in order to learn more about...
View ArticlePeople in research groups, what is your setup for shared server/resources?
So my university finally decided to shell out some money to meet their expectations of our research scaling up to, *gulp*, "big data". I am not sure as to what we are going to get (they do not want to...
View ArticleTraining Random Forest; Data Too Big For Memory
How do people deal with fitting random forests when data is too big for memory? Currently I use sklearn in python and sample every N rows s.t. the data fits in memory. submitted by ml_man [link][8...
View ArticleHow to classify text with scikit learn?
I have extracted some bigram features(Adjective/Adverb, Noun/Adjective, Adjective/Verb, Noun/Verb) from a POS tagged corpus with the help of it´s POS tags. The bigrams look as follows: [('word',...
View ArticleProcess for finding which machine learning algorithms/package to use?
I'd like to hear your processes for figuring out what machine learning algorithms/packages to use when tackling a machine learning problem. What are the factors you consider? Considerations about the...
View ArticleWhats your impression of our NLP Content Analysis & Summarization platform?
We built a temporary portal for testing of our upcoming API & platform. So far we are quiet pleased with the results, but could use additional feedback from Reddit friends.This will be...
View ArticleDefending Microfinance with Data Science
submitted by andrewinshorts [link][2 comments]
View ArticleOpen problems in Machine Learning
What do you consider to be some of the major open problems in machine learning and its associated fields? Both practical and theoretical problems are welcome, but for the sake of conciseness leave out...
View ArticleUC Berkeley's Michael Jordan on the epidemic of bad ideas about machine...
submitted by boboleo [link][comment]
View ArticleQuestion regarding model selection for linear discriminant analysis models
Hi,I'm working on a classification project for a data mining class, and we have to test several classification models on a dataset. (250,000 observations, 2 groups, 30 independent variables)I'm working...
View ArticleTerence Tao's philosophical take on strong AI
I thought Terence's answer to this question to be thought provoking, so wanted to hear others' opinions.Q: What is your opinion on strong AI?A: The funny thing about AI is that it’s a moving target. In...
View ArticleUse historic data as training data for late/on time payments? (python, scikit)
I have some flex time where I can pursue my interests at work as long as I can apply it there. Recently I obtained ~100,000 invoice records over two years. Can I use this as training data for a...
View ArticleAnalyzing the omics of the brain: upcoming NIPS workshop bringing together...
submitted by urish [link][comment]
View Article