g***@apache.org
2018-06-26 19:11:13 UTC
Repository: mahout
Updated Branches:
refs/heads/asf-site 5829ad890 -> 7aee324d6
Automatic Site Publish by Buildbot
Project: http://git-wip-us.apache.org/repos/asf/mahout/repo
Commit: http://git-wip-us.apache.org/repos/asf/mahout/commit/7aee324d
Tree: http://git-wip-us.apache.org/repos/asf/mahout/tree/7aee324d
Diff: http://git-wip-us.apache.org/repos/asf/mahout/diff/7aee324d
Branch: refs/heads/asf-site
Commit: 7aee324d6317129966480ad2956c032277897ddf
Parents: 5829ad8
Author: jenkins <***@apache.org>
Authored: Tue Jun 26 19:11:09 2018 +0000
Committer: jenkins <***@apache.org>
Committed: Tue Jun 26 19:11:09 2018 +0000
----------------------------------------------------------------------
feed.xml | 21 ++-
index.html | 31 ++++
news/2018/06/18/preping-014.html | 201 ++++++++++++++++++++++++
users/algorithms/recommender-overview.html | 14 +-
4 files changed, 263 insertions(+), 4 deletions(-)
----------------------------------------------------------------------
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/feed.xml
----------------------------------------------------------------------
diff --git a/feed.xml b/feed.xml
index b659dfa..384fef6 100644
--- a/feed.xml
+++ b/feed.xml
@@ -1,4 +1,23 @@
-<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.5.0">Jekyll</generator><link href="http://mahout.apache.org//feed.xml" rel="self" type="application/atom+xml" /><link href="http://mahout.apache.org//" rel="alternate" type="text/html" /><updated>2018-06-18T19:47:01+00:00</updated><id>http://mahout.apache.org//</id><title type="html">Apache Mahout</title><subtitle>Distributed Linear Algebra</subtitle><author><name>The Apache Software Foundation</name><email>***@mahout.apache.org</email></author><entry><title type="html">New Website</title><link href="http://mahout.apache.org//news/2017/12/20/new-website.html" rel="alternate" type="text/html" title="New Website" /><published>2017-12-20T22:16:01+00:00</published><updated>2017-12-20T22:16:01+00:00</updated><id>http://mahout.apache.org//news/2017/12/20/new-website</id><content type="html" xml:base="http://mahout.apache.org//news/2017/12/20/new-website.html">
<p>We’ve launced a new website, (a HUUUGE special thanks to David Miller from <a href="http://startbootstrap.com">StartBootstrap.com</a>).</p>
+<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.5.0">Jekyll</generator><link href="http://mahout.apache.org//feed.xml" rel="self" type="application/atom+xml" /><link href="http://mahout.apache.org//" rel="alternate" type="text/html" /><updated>2018-06-26T19:10:24+00:00</updated><id>http://mahout.apache.org//</id><title type="html">Apache Mahout</title><subtitle>Distributed Linear Algebra</subtitle><author><name>The Apache Software Foundation</name><email>***@mahout.apache.org</email></author><entry><title type="html">Preparing for version 0.14.0</title><link href="http://mahout.apache.org//news/2018/06/18/preping-014.html" rel="alternate" type="text/html" title="Preparing for version 0.14.0" /><published>2018-06-18T22:16:01+00:00</published><updated>2018-06-18T22:16:01+00:00</updated><id>http://mahout.apache.org//news/2018/06/18/preping-014</id><content type="html" xml:base="http://mahout.apache.org//
news/2018/06/18/preping-014.html"><p>Hey all! We’re preparing for Mahout version 0.14.0.</p>
+
+<p>This release might not seem super exciting from a user perspective (except
+we’re bumping Apache Spark to version 2.x and Scala to version 2.11) but
+it includes some very important ‘behind the scenes changes’. We’re doing a
+heavy refactoring which will combine <code class="highlighter-rouge">math</code> and <code class="highlighter-rouge">math-scala</code> into a module
+called <code class="highlighter-rouge">core</code> (no change to code required!).</p>
+
+<p>We will be moving H2o and Flink-batch engines into <code class="highlighter-rouge">community</code> that is to
+say the code will still be available, but we won’t be actively maintaining
+it (we will accept PRs though). We’re moving the Map Reduce code to community
+and have considered to once again accepting PRs against this code (if you have
+an open PR, we’ll hit you up after the release about rebasing and considering to
+resubmit the PR)</p>
+
+<p>We’re going to be updating the Java dependency from 1.7 to 1.8.</p>
+
+<p>There was a LOT of old junk in the POMs which we have clipped out.</p>
+
+<p>Stay tuned and <a href="https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=80&amp;projectKey=MAHOUT&amp;view=detail&amp;selectedIssue=MAHOUT-2047">feel free to help out</a>!</p></content><author><name>The Apache Software Foundation</name><email>***@mahout.apache.org</email></author><category term="news" /><summary type="html">Hey all! We’re preparing for Mahout version 0.14.0.</summary></entry><entry><title type="html">New Website</title><link href="http://mahout.apache.org//news/2017/12/20/new-website.html" rel="alternate" type="text/html" title="New Website" /><published>2017-12-20T22:16:01+00:00</published><updated>2017-12-20T22:16:01+00:00</updated><id>http://mahout.apache.org//news/2017/12/20/new-website</id><content type="html" xml:base="http://mahout.apache.org//news/2017/12/20/new-website.html"><p>We’ve launced a new website, (a HUUUGE special thanks to David Miller from <a href="http://startboot
strap.com">StartBootstrap.com</a>).</p>
<p>It looks great but there’s lots to do as we migrate and clean up content from the old site. If you see anything broken,
please let us know via either:</p>
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/index.html
----------------------------------------------------------------------
diff --git a/index.html b/index.html
index 4aed681..802cb4f 100644
--- a/index.html
+++ b/index.html
@@ -195,6 +195,37 @@
</ul>
+ <div class="float-md-right small text-muted">June 18, 2018</div>
+ </div>
+ <h3 class="card-title"><a href="/news/2018/06/18/preping-014.html">Preparing for version 0.14.0</a></h3>
+ <p>Hey all! We’re preparing for Mahout version 0.14.0.
+
+</p>
+ <a href="/news/2018/06/18/preping-014.html" class="btn btn-primary btn-sm">Read Post →</a>
+ </div>
+ </div>
+ </div>
+</div>
+
+<div class="card card-post-preview mb-4">
+ <div class="card-body">
+ <div class="row no-gutters">
+ <div class="col-3 d-none d-md-flex my-auto card-post-img">
+
+ <img class="rounded-circle img-fluid d-flex mx-auto" src="/assets/mahout-logo-blue.svg" />
+
+ </div>
+ <div class="col-12 col-md-9 card-post-text">
+ <div class="clearfix mb-3">
+ <ul class="list-inline float-md-left mb-md-0">
+
+
+ <li class="list-inline-item">
+ <a class="small text-uppercase" href="#">news</a>
+ </li>
+
+
+ </ul>
<div class="float-md-right small text-muted">December 20, 2017</div>
</div>
<h3 class="card-title"><a href="/news/2017/12/20/new-website.html">New Website</a></h3>
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/news/2018/06/18/preping-014.html
----------------------------------------------------------------------
diff --git a/news/2018/06/18/preping-014.html b/news/2018/06/18/preping-014.html
new file mode 100644
index 0000000..d0e4878
--- /dev/null
+++ b/news/2018/06/18/preping-014.html
@@ -0,0 +1,201 @@
+<!DOCTYPE html>
+<html lang=" en ">
+
+<head>
+ <meta charset="utf-8">
+ <meta http-equiv="X-UA-Compatible" content="IE=edge">
+ <meta name="viewport" content="width=device-width, initial-scale=1">
+
+ <title>
+ Preparing for version 0.14.0
+
+ </title>
+
+ <meta name="description" content="Hey all! We’re preparing for Mahout version 0.14.0.">
+
+ <link rel="stylesheet" href="/assets/css/main.css">
+
+ <!-- Font Awesome -->
+ <link href="https://maxcdn.bootstrapcdn.com/font-awesome/4.7.0/css/font-awesome.min.css" rel="stylesheet" integrity="sha384-wvfXpqpZZVQGK6TAh5PVlGOfQNHSoD2xbE+QkPxCAFlNEevoEH3Sl0sibVcOQVnN" crossorigin="anonymous">
+
+ <!-- Google Fonts -->
+ <link href="https://fonts.googleapis.com/css?family=Maven+Pro:400,500" rel="stylesheet">
+ <link href="https://fonts.googleapis.com/css?family=Muli:400,400i,700,700i" rel="stylesheet">
+
+ <link rel="canonical" href="http://mahout.apache.org//news/2018/06/18/preping-014.html">
+ <link rel="alternate" type="application/rss+xml" title="Apache Mahout" href="/%20/feed.xml">
+
+
+</head>
+
+<script>
+ (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+ (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+ m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+ })(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ ga('create', 'UA-98314020-1', 'auto');
+ ga('send', 'pageview');
+</script>
+<body>
+
+ <nav class="navbar navbar-expand-lg navbar-light bg-light navbar-mahout">
+
+ <div class="container">
+
+ <a class="navbar-brand" href="/">
+ <img src="/assets/mahout-logo-blue.svg" alt="">
+ </a>
+
+ <button class="navbar-toggler" type="button" data-toggle="collapse" data-target="#navbarSupportedContent" aria-controls="navbarSupportedContent" aria-expanded="false" aria-label="Toggle navigation">
+ <span class="navbar-toggler-icon"></span>
+ </button>
+
+ <div class="collapse navbar-collapse" id="navbarSupportedContent">
+
+ <ul class="navbar-nav ml-auto">
+
+ <!-- Download -->
+ <li class="nav-item">
+ <a class="nav-link" href="/general/downloads">Download</a>
+ </li>
+
+ <li class="nav-item">
+ <a class="nav-link" href="/docs/latest/index.html">Overview</a>
+ </li>
+
+ <!-- Developers -->
+ <li class="nav-item dropdown">
+ <a class="nav-link dropdown-toggle" href="" id="navbarDropdownMenuLink" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">Developers</a>
+ <div class="dropdown-menu" aria-labelledby="navbarDropdownMenuLink">
+ <a class="dropdown-item" href="/developers/developer-resources.html">Developer Resources</a>
+ <a class="dropdown-item" href="/developers/buildingmahout">Building Mahout from Source</a>
+ <a class="dropdown-item" href="/developers/issue-tracker">Issues Tracking (JIRA)</a>
+ <!-- <a class="dropdown-item" href="/developers/patch-check-list/">Patch Check List</a> going to github template -->
+ <!-- <a class="dropdown-item" href="/developers/reference/">References</a> a lot of overlap with books, talks, etc. page -->
+ <a class="dropdown-item" href="/developers/release-notes/">Release Notes</a>
+ <!-- <a class="dropdown-item" href="/developers/thirdparty-dependencies/">Third Party Dependencies</a> is our site the reasonable place for this? -->
+ <!-- <a class="dropdown-item" href="/developers/version-control/">Version Control</a> -->
+ <div class="dropdown-divider"></div>
+ <h6 class="dropdown-header">How To's</h6>
+ <a class="dropdown-item" href="/developers/how-to-contribute">How to Contribute</a>
+ <a class="dropdown-item" href="/developers/githubPRs">Github PRs</a>
+ <a class="dropdown-item" href="/developers/how-to-become-a-committer">How to Become a Committer</a>
+ <a class="dropdown-item" href="/developers/how-to-release">How to Release</a>
+ <a class="dropdown-item" href="/developers/how-to-update-the-website">How to Update the Website</a>
+ </div>
+ </li>
+
+ <!-- Docs -->
+ <li class="nav-item dropdown">
+ <a class="nav-link dropdown-toggle" href="" id="navbarDropdownMenuLink" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">Docs</a>
+ <div class="dropdown-menu" aria-labelledby="navbarDropdownMenuLink">
+ <h6 class="dropdown-header">Release</h6>
+ <a class="dropdown-item" href="/docs/0.13.0">0.13.0</a>
+ <div class="dropdown-divider"></div>
+ <h6 class="dropdown-header">Latest Snapshot (Development)</h6>
+ <a class="dropdown-item" href="/docs/latest">0.13.1-SNAPSHOT</a>
+ </div>
+ </li>
+
+ <!-- Community -->
+ <li class="nav-item dropdown">
+ <a class="nav-link dropdown-toggle" href="" id="navbarDropdownMenuLink" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">Community</a>
+ <div class="dropdown-menu dropdown-menu-right">
+ <!--<a class="dropdown-item" href="/community/history/">History of the Apache Mahout Project</a>-->
+ <!--<a class="dropdown-item" href="/community/blogs/">Blog Posts About Mahout</a>-->
+ <!--<a class="dropdown-item" href="/community/recent-upcoming-talks/">Recent and Upcoming Talks</a>-->
+ <!-- <a class="dropdown-item" href="/community/books-tutorials-and-talks/">Books Tutorials and Talks</a> -->
+ <!-- <a class="dropdown-item" href="/community/faq/">FAQ</a> needs a lot of updating -->
+ <a class="dropdown-item" href="/developers/gsoc">GSoC</a>
+ <!-- Is OK- updated Map/Reduce verbage to reflect Samsara -->
+ <!-- <a class="dropdown-item" href="/community/mahout-benchmarks/">Mahout Benchmarks</a> These are old, Keep them? -->
+ <!-- <a class="dropdown-item" href="/community/mahout-wiki/">Mahout Wiki</a> at very least needs links cleanedup - do we still want this even?-->
+ <a class="dropdown-item" href="/general/mailing-lists">Mailing Lists</a>
+ <!-- Clean and pretty -->
+ <!-- <a class="dropdown-item" href="/community/powered-by-mahout/">Powered By Mahout</a> needs update -->
+ <a class="dropdown-item" href="/general/privacy-policy">Privacy Policy</a>
+ <!-- <a class="dropdown-item" href="/community/professional-support/">Professional Support</a> update if we even want to keep -->
+ <a class="dropdown-item" href="/general/who-we-are">Who We Are</a>
+ <!-- nikolai needs to add himself -->
+ </div>
+ </li>
+
+ <!-- GitHub -->
+ <li class="nav-item">
+ <a class="nav-link" href="http://github.com/apache/mahout"><i class="fa fa-github"></i></a>
+ </li>
+
+ </ul>
+
+ <!-- <form class="form-inline my-2 my-lg-0">
+ <input class="form-control mr-sm-2" type="text" placeholder="Search" aria-label="Search">
+ <button class="btn btn-outline-success my-2 my-sm-0" type="submit">Search</button>
+ </form> -->
+
+ </div>
+
+ </div>
+
+</nav>
+
+
+ <div class="container mt-5 pb-4">
+
+ <article class="post" itemscope="itemscope" itemtype="http://schema.org/BlogPosting">
+
+ <header class="post-header">
+ <h1 class="post-title" itemprop="name headline">Preparing for version 0.14.0</h1>
+ <p class="post-meta">
+ <time datetime="2018-06-18T22:16:01+00:00" itemprop="datePublished">
+ 2018-06-18 22:16:01 +0000
+ </time>
+
+ </p>
+ </header>
+
+ <div class="post-content" itemprop="articleBody">
+ <p>Hey all! We’re preparing for Mahout version 0.14.0.</p>
+
+<p>This release might not seem super exciting from a user perspective (except
+we’re bumping Apache Spark to version 2.x and Scala to version 2.11) but
+it includes some very important ‘behind the scenes changes’. We’re doing a
+heavy refactoring which will combine <code class="highlighter-rouge">math</code> and <code class="highlighter-rouge">math-scala</code> into a module
+called <code class="highlighter-rouge">core</code> (no change to code required!).</p>
+
+<p>We will be moving H2o and Flink-batch engines into <code class="highlighter-rouge">community</code> that is to
+say the code will still be available, but we won’t be actively maintaining
+it (we will accept PRs though). We’re moving the Map Reduce code to community
+and have considered to once again accepting PRs against this code (if you have
+an open PR, we’ll hit you up after the release about rebasing and considering to
+resubmit the PR)</p>
+
+<p>We’re going to be updating the Java dependency from 1.7 to 1.8.</p>
+
+<p>There was a LOT of old junk in the POMs which we have clipped out.</p>
+
+<p>Stay tuned and <a href="https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=80&projectKey=MAHOUT&view=detail&selectedIssue=MAHOUT-2047">feel free to help out</a>!</p>
+
+
+ </div>
+
+ </article>
+
+</div>
+
+
+ <footer class="footer bg-light">
+ <div class="container text-center small">
+ Copyright © 2014-2018 The Apache Software Foundation, Licensed under the Apache License, Version 2.0.
+ </div>
+</footer>
+
+ <script src="/assets/vendor/jquery/jquery-slim.min.js"></script>
+ <script src="/assets/vendor/popper/popper.min.js"></script>
+ <script src="/assets/vendor/bootstrap/js/bootstrap.min.js"></script>
+ <script src="/assets/header.js"></script>
+ <script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.0/MathJax.js?config=TeX-AMS-MML_HTMLorMML" type="text/javascript"></script>
+
+</body>
+
+</html>
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/users/algorithms/recommender-overview.html
----------------------------------------------------------------------
diff --git a/users/algorithms/recommender-overview.html b/users/algorithms/recommender-overview.html
index 6cbdb2b..a792859 100644
--- a/users/algorithms/recommender-overview.html
+++ b/users/algorithms/recommender-overview.html
@@ -142,7 +142,15 @@
<h1 id="recommender-overview">Recommender Overview</h1>
-<p>Recommenders have changed over the years. Mahout contains a long list of them, which you can still use. But to get the best out of our more modern aproach we’ll need to think of the Recommender as a “model creation” component—supplied by Mahout’s new spark-itemsimilarity job, and a “serving” component—supplied by a modern scalable search engine, like Solr.</p>
+<p>Recommenders have changed over the years. Mahout contains a long list of them, which you can still use. However in about 2013 there was a revolution in recommenders, which favored what we might call “Multimodal”, meaning they could take in data of all sorts—basically anything we might think was an indicator of user taste. The new Samsara algorithm, called Correlated Cross-Occurrence (CCO) is just such a next gen recommender algorithm but Mahout-Samsara only implements the model building part. This can be integrated as the user see fit and the rest of this doc will explain how.</p>
+
+<h2 id="turnkey-implementation">Turnkey Implementation</h2>
+
+<p>If you are looking for an end-to-end OSS recommender based on the Mahout CCO algorithm have a look at <a href="https://github.com/actionml/universal-recommender">The Universal Recommender</a>, which is implemented using <a href="http://predictionio.apache.org/">Apache PredictionIO</a>. See instructions for <a href="http://actionml.com/docs/pio_by_actionml">installation here</a>. There is even an AWS AMI for convenience (this is a for-pay option)</p>
+
+<h2 id="build-your-own-integration">Build Your Own Integration</h2>
+
+<p>To get the most out of our more modern CCO algorithm we’ll need to think of the Recommender as a “model creation” component—supplied by Mahout’s new spark-itemsimilarity job, and a “serving” component—supplied by a modern scalable search engine, like Solr or Elasticsearch. Here we describe a loose integration that does not require using Mahout as a library, it uses Mahout’s command line interface. This is clearly not the best but allows one to experiments and get a real recommender running easily.</p>
<p><img src="Loading Image...
" alt="image" /></p>
@@ -154,7 +162,7 @@
<p>All ids for users and items are preserved as string tokens and so work as an external key in DBs or as doc ids for search engines, they also work as tokens for search queries.</p>
-<p>##References</p>
+<h2 id="references">References</h2>
<ol>
<li>A free ebook, which talks about the general idea: <a href="https://www.mapr.com/practical-machine-learning">Practical Machine Learning</a></li>
@@ -164,7 +172,7 @@ and <a href="http://occamsmachete.com/ml/2014/09/09/mahout-on-spark-whats-new-i
<li>A post describing the loglikelihood ratio: <a href="http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html">Surprise and Coinsidense</a> LLR is used to reduce noise in the data while keeping the calculations O(n) complexity.</li>
</ol>
-<p>##Mahout Model Creation</p>
+<h2 id="mahout-model-creation">Mahout Model Creation</h2>
<p>See the page describing <a href="http://mahout.apache.org/users/recommender/intro-cooccurrence-spark.html"><em>spark-itemsimilarity</em></a> for more details.</p>
Updated Branches:
refs/heads/asf-site 5829ad890 -> 7aee324d6
Automatic Site Publish by Buildbot
Project: http://git-wip-us.apache.org/repos/asf/mahout/repo
Commit: http://git-wip-us.apache.org/repos/asf/mahout/commit/7aee324d
Tree: http://git-wip-us.apache.org/repos/asf/mahout/tree/7aee324d
Diff: http://git-wip-us.apache.org/repos/asf/mahout/diff/7aee324d
Branch: refs/heads/asf-site
Commit: 7aee324d6317129966480ad2956c032277897ddf
Parents: 5829ad8
Author: jenkins <***@apache.org>
Authored: Tue Jun 26 19:11:09 2018 +0000
Committer: jenkins <***@apache.org>
Committed: Tue Jun 26 19:11:09 2018 +0000
----------------------------------------------------------------------
feed.xml | 21 ++-
index.html | 31 ++++
news/2018/06/18/preping-014.html | 201 ++++++++++++++++++++++++
users/algorithms/recommender-overview.html | 14 +-
4 files changed, 263 insertions(+), 4 deletions(-)
----------------------------------------------------------------------
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/feed.xml
----------------------------------------------------------------------
diff --git a/feed.xml b/feed.xml
index b659dfa..384fef6 100644
--- a/feed.xml
+++ b/feed.xml
@@ -1,4 +1,23 @@
-<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.5.0">Jekyll</generator><link href="http://mahout.apache.org//feed.xml" rel="self" type="application/atom+xml" /><link href="http://mahout.apache.org//" rel="alternate" type="text/html" /><updated>2018-06-18T19:47:01+00:00</updated><id>http://mahout.apache.org//</id><title type="html">Apache Mahout</title><subtitle>Distributed Linear Algebra</subtitle><author><name>The Apache Software Foundation</name><email>***@mahout.apache.org</email></author><entry><title type="html">New Website</title><link href="http://mahout.apache.org//news/2017/12/20/new-website.html" rel="alternate" type="text/html" title="New Website" /><published>2017-12-20T22:16:01+00:00</published><updated>2017-12-20T22:16:01+00:00</updated><id>http://mahout.apache.org//news/2017/12/20/new-website</id><content type="html" xml:base="http://mahout.apache.org//news/2017/12/20/new-website.html">
<p>We’ve launced a new website, (a HUUUGE special thanks to David Miller from <a href="http://startbootstrap.com">StartBootstrap.com</a>).</p>
+<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.5.0">Jekyll</generator><link href="http://mahout.apache.org//feed.xml" rel="self" type="application/atom+xml" /><link href="http://mahout.apache.org//" rel="alternate" type="text/html" /><updated>2018-06-26T19:10:24+00:00</updated><id>http://mahout.apache.org//</id><title type="html">Apache Mahout</title><subtitle>Distributed Linear Algebra</subtitle><author><name>The Apache Software Foundation</name><email>***@mahout.apache.org</email></author><entry><title type="html">Preparing for version 0.14.0</title><link href="http://mahout.apache.org//news/2018/06/18/preping-014.html" rel="alternate" type="text/html" title="Preparing for version 0.14.0" /><published>2018-06-18T22:16:01+00:00</published><updated>2018-06-18T22:16:01+00:00</updated><id>http://mahout.apache.org//news/2018/06/18/preping-014</id><content type="html" xml:base="http://mahout.apache.org//
news/2018/06/18/preping-014.html"><p>Hey all! We’re preparing for Mahout version 0.14.0.</p>
+
+<p>This release might not seem super exciting from a user perspective (except
+we’re bumping Apache Spark to version 2.x and Scala to version 2.11) but
+it includes some very important ‘behind the scenes changes’. We’re doing a
+heavy refactoring which will combine <code class="highlighter-rouge">math</code> and <code class="highlighter-rouge">math-scala</code> into a module
+called <code class="highlighter-rouge">core</code> (no change to code required!).</p>
+
+<p>We will be moving H2o and Flink-batch engines into <code class="highlighter-rouge">community</code> that is to
+say the code will still be available, but we won’t be actively maintaining
+it (we will accept PRs though). We’re moving the Map Reduce code to community
+and have considered to once again accepting PRs against this code (if you have
+an open PR, we’ll hit you up after the release about rebasing and considering to
+resubmit the PR)</p>
+
+<p>We’re going to be updating the Java dependency from 1.7 to 1.8.</p>
+
+<p>There was a LOT of old junk in the POMs which we have clipped out.</p>
+
+<p>Stay tuned and <a href="https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=80&amp;projectKey=MAHOUT&amp;view=detail&amp;selectedIssue=MAHOUT-2047">feel free to help out</a>!</p></content><author><name>The Apache Software Foundation</name><email>***@mahout.apache.org</email></author><category term="news" /><summary type="html">Hey all! We’re preparing for Mahout version 0.14.0.</summary></entry><entry><title type="html">New Website</title><link href="http://mahout.apache.org//news/2017/12/20/new-website.html" rel="alternate" type="text/html" title="New Website" /><published>2017-12-20T22:16:01+00:00</published><updated>2017-12-20T22:16:01+00:00</updated><id>http://mahout.apache.org//news/2017/12/20/new-website</id><content type="html" xml:base="http://mahout.apache.org//news/2017/12/20/new-website.html"><p>We’ve launced a new website, (a HUUUGE special thanks to David Miller from <a href="http://startboot
strap.com">StartBootstrap.com</a>).</p>
<p>It looks great but there’s lots to do as we migrate and clean up content from the old site. If you see anything broken,
please let us know via either:</p>
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/index.html
----------------------------------------------------------------------
diff --git a/index.html b/index.html
index 4aed681..802cb4f 100644
--- a/index.html
+++ b/index.html
@@ -195,6 +195,37 @@
</ul>
+ <div class="float-md-right small text-muted">June 18, 2018</div>
+ </div>
+ <h3 class="card-title"><a href="/news/2018/06/18/preping-014.html">Preparing for version 0.14.0</a></h3>
+ <p>Hey all! We’re preparing for Mahout version 0.14.0.
+
+</p>
+ <a href="/news/2018/06/18/preping-014.html" class="btn btn-primary btn-sm">Read Post →</a>
+ </div>
+ </div>
+ </div>
+</div>
+
+<div class="card card-post-preview mb-4">
+ <div class="card-body">
+ <div class="row no-gutters">
+ <div class="col-3 d-none d-md-flex my-auto card-post-img">
+
+ <img class="rounded-circle img-fluid d-flex mx-auto" src="/assets/mahout-logo-blue.svg" />
+
+ </div>
+ <div class="col-12 col-md-9 card-post-text">
+ <div class="clearfix mb-3">
+ <ul class="list-inline float-md-left mb-md-0">
+
+
+ <li class="list-inline-item">
+ <a class="small text-uppercase" href="#">news</a>
+ </li>
+
+
+ </ul>
<div class="float-md-right small text-muted">December 20, 2017</div>
</div>
<h3 class="card-title"><a href="/news/2017/12/20/new-website.html">New Website</a></h3>
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/news/2018/06/18/preping-014.html
----------------------------------------------------------------------
diff --git a/news/2018/06/18/preping-014.html b/news/2018/06/18/preping-014.html
new file mode 100644
index 0000000..d0e4878
--- /dev/null
+++ b/news/2018/06/18/preping-014.html
@@ -0,0 +1,201 @@
+<!DOCTYPE html>
+<html lang=" en ">
+
+<head>
+ <meta charset="utf-8">
+ <meta http-equiv="X-UA-Compatible" content="IE=edge">
+ <meta name="viewport" content="width=device-width, initial-scale=1">
+
+ <title>
+ Preparing for version 0.14.0
+
+ </title>
+
+ <meta name="description" content="Hey all! We’re preparing for Mahout version 0.14.0.">
+
+ <link rel="stylesheet" href="/assets/css/main.css">
+
+ <!-- Font Awesome -->
+ <link href="https://maxcdn.bootstrapcdn.com/font-awesome/4.7.0/css/font-awesome.min.css" rel="stylesheet" integrity="sha384-wvfXpqpZZVQGK6TAh5PVlGOfQNHSoD2xbE+QkPxCAFlNEevoEH3Sl0sibVcOQVnN" crossorigin="anonymous">
+
+ <!-- Google Fonts -->
+ <link href="https://fonts.googleapis.com/css?family=Maven+Pro:400,500" rel="stylesheet">
+ <link href="https://fonts.googleapis.com/css?family=Muli:400,400i,700,700i" rel="stylesheet">
+
+ <link rel="canonical" href="http://mahout.apache.org//news/2018/06/18/preping-014.html">
+ <link rel="alternate" type="application/rss+xml" title="Apache Mahout" href="/%20/feed.xml">
+
+
+</head>
+
+<script>
+ (function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
+ (i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
+ m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
+ })(window,document,'script','//www.google-analytics.com/analytics.js','ga');
+
+ ga('create', 'UA-98314020-1', 'auto');
+ ga('send', 'pageview');
+</script>
+<body>
+
+ <nav class="navbar navbar-expand-lg navbar-light bg-light navbar-mahout">
+
+ <div class="container">
+
+ <a class="navbar-brand" href="/">
+ <img src="/assets/mahout-logo-blue.svg" alt="">
+ </a>
+
+ <button class="navbar-toggler" type="button" data-toggle="collapse" data-target="#navbarSupportedContent" aria-controls="navbarSupportedContent" aria-expanded="false" aria-label="Toggle navigation">
+ <span class="navbar-toggler-icon"></span>
+ </button>
+
+ <div class="collapse navbar-collapse" id="navbarSupportedContent">
+
+ <ul class="navbar-nav ml-auto">
+
+ <!-- Download -->
+ <li class="nav-item">
+ <a class="nav-link" href="/general/downloads">Download</a>
+ </li>
+
+ <li class="nav-item">
+ <a class="nav-link" href="/docs/latest/index.html">Overview</a>
+ </li>
+
+ <!-- Developers -->
+ <li class="nav-item dropdown">
+ <a class="nav-link dropdown-toggle" href="" id="navbarDropdownMenuLink" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">Developers</a>
+ <div class="dropdown-menu" aria-labelledby="navbarDropdownMenuLink">
+ <a class="dropdown-item" href="/developers/developer-resources.html">Developer Resources</a>
+ <a class="dropdown-item" href="/developers/buildingmahout">Building Mahout from Source</a>
+ <a class="dropdown-item" href="/developers/issue-tracker">Issues Tracking (JIRA)</a>
+ <!-- <a class="dropdown-item" href="/developers/patch-check-list/">Patch Check List</a> going to github template -->
+ <!-- <a class="dropdown-item" href="/developers/reference/">References</a> a lot of overlap with books, talks, etc. page -->
+ <a class="dropdown-item" href="/developers/release-notes/">Release Notes</a>
+ <!-- <a class="dropdown-item" href="/developers/thirdparty-dependencies/">Third Party Dependencies</a> is our site the reasonable place for this? -->
+ <!-- <a class="dropdown-item" href="/developers/version-control/">Version Control</a> -->
+ <div class="dropdown-divider"></div>
+ <h6 class="dropdown-header">How To's</h6>
+ <a class="dropdown-item" href="/developers/how-to-contribute">How to Contribute</a>
+ <a class="dropdown-item" href="/developers/githubPRs">Github PRs</a>
+ <a class="dropdown-item" href="/developers/how-to-become-a-committer">How to Become a Committer</a>
+ <a class="dropdown-item" href="/developers/how-to-release">How to Release</a>
+ <a class="dropdown-item" href="/developers/how-to-update-the-website">How to Update the Website</a>
+ </div>
+ </li>
+
+ <!-- Docs -->
+ <li class="nav-item dropdown">
+ <a class="nav-link dropdown-toggle" href="" id="navbarDropdownMenuLink" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">Docs</a>
+ <div class="dropdown-menu" aria-labelledby="navbarDropdownMenuLink">
+ <h6 class="dropdown-header">Release</h6>
+ <a class="dropdown-item" href="/docs/0.13.0">0.13.0</a>
+ <div class="dropdown-divider"></div>
+ <h6 class="dropdown-header">Latest Snapshot (Development)</h6>
+ <a class="dropdown-item" href="/docs/latest">0.13.1-SNAPSHOT</a>
+ </div>
+ </li>
+
+ <!-- Community -->
+ <li class="nav-item dropdown">
+ <a class="nav-link dropdown-toggle" href="" id="navbarDropdownMenuLink" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">Community</a>
+ <div class="dropdown-menu dropdown-menu-right">
+ <!--<a class="dropdown-item" href="/community/history/">History of the Apache Mahout Project</a>-->
+ <!--<a class="dropdown-item" href="/community/blogs/">Blog Posts About Mahout</a>-->
+ <!--<a class="dropdown-item" href="/community/recent-upcoming-talks/">Recent and Upcoming Talks</a>-->
+ <!-- <a class="dropdown-item" href="/community/books-tutorials-and-talks/">Books Tutorials and Talks</a> -->
+ <!-- <a class="dropdown-item" href="/community/faq/">FAQ</a> needs a lot of updating -->
+ <a class="dropdown-item" href="/developers/gsoc">GSoC</a>
+ <!-- Is OK- updated Map/Reduce verbage to reflect Samsara -->
+ <!-- <a class="dropdown-item" href="/community/mahout-benchmarks/">Mahout Benchmarks</a> These are old, Keep them? -->
+ <!-- <a class="dropdown-item" href="/community/mahout-wiki/">Mahout Wiki</a> at very least needs links cleanedup - do we still want this even?-->
+ <a class="dropdown-item" href="/general/mailing-lists">Mailing Lists</a>
+ <!-- Clean and pretty -->
+ <!-- <a class="dropdown-item" href="/community/powered-by-mahout/">Powered By Mahout</a> needs update -->
+ <a class="dropdown-item" href="/general/privacy-policy">Privacy Policy</a>
+ <!-- <a class="dropdown-item" href="/community/professional-support/">Professional Support</a> update if we even want to keep -->
+ <a class="dropdown-item" href="/general/who-we-are">Who We Are</a>
+ <!-- nikolai needs to add himself -->
+ </div>
+ </li>
+
+ <!-- GitHub -->
+ <li class="nav-item">
+ <a class="nav-link" href="http://github.com/apache/mahout"><i class="fa fa-github"></i></a>
+ </li>
+
+ </ul>
+
+ <!-- <form class="form-inline my-2 my-lg-0">
+ <input class="form-control mr-sm-2" type="text" placeholder="Search" aria-label="Search">
+ <button class="btn btn-outline-success my-2 my-sm-0" type="submit">Search</button>
+ </form> -->
+
+ </div>
+
+ </div>
+
+</nav>
+
+
+ <div class="container mt-5 pb-4">
+
+ <article class="post" itemscope="itemscope" itemtype="http://schema.org/BlogPosting">
+
+ <header class="post-header">
+ <h1 class="post-title" itemprop="name headline">Preparing for version 0.14.0</h1>
+ <p class="post-meta">
+ <time datetime="2018-06-18T22:16:01+00:00" itemprop="datePublished">
+ 2018-06-18 22:16:01 +0000
+ </time>
+
+ </p>
+ </header>
+
+ <div class="post-content" itemprop="articleBody">
+ <p>Hey all! We’re preparing for Mahout version 0.14.0.</p>
+
+<p>This release might not seem super exciting from a user perspective (except
+we’re bumping Apache Spark to version 2.x and Scala to version 2.11) but
+it includes some very important ‘behind the scenes changes’. We’re doing a
+heavy refactoring which will combine <code class="highlighter-rouge">math</code> and <code class="highlighter-rouge">math-scala</code> into a module
+called <code class="highlighter-rouge">core</code> (no change to code required!).</p>
+
+<p>We will be moving H2o and Flink-batch engines into <code class="highlighter-rouge">community</code> that is to
+say the code will still be available, but we won’t be actively maintaining
+it (we will accept PRs though). We’re moving the Map Reduce code to community
+and have considered to once again accepting PRs against this code (if you have
+an open PR, we’ll hit you up after the release about rebasing and considering to
+resubmit the PR)</p>
+
+<p>We’re going to be updating the Java dependency from 1.7 to 1.8.</p>
+
+<p>There was a LOT of old junk in the POMs which we have clipped out.</p>
+
+<p>Stay tuned and <a href="https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=80&projectKey=MAHOUT&view=detail&selectedIssue=MAHOUT-2047">feel free to help out</a>!</p>
+
+
+ </div>
+
+ </article>
+
+</div>
+
+
+ <footer class="footer bg-light">
+ <div class="container text-center small">
+ Copyright © 2014-2018 The Apache Software Foundation, Licensed under the Apache License, Version 2.0.
+ </div>
+</footer>
+
+ <script src="/assets/vendor/jquery/jquery-slim.min.js"></script>
+ <script src="/assets/vendor/popper/popper.min.js"></script>
+ <script src="/assets/vendor/bootstrap/js/bootstrap.min.js"></script>
+ <script src="/assets/header.js"></script>
+ <script src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.0/MathJax.js?config=TeX-AMS-MML_HTMLorMML" type="text/javascript"></script>
+
+</body>
+
+</html>
http://git-wip-us.apache.org/repos/asf/mahout/blob/7aee324d/users/algorithms/recommender-overview.html
----------------------------------------------------------------------
diff --git a/users/algorithms/recommender-overview.html b/users/algorithms/recommender-overview.html
index 6cbdb2b..a792859 100644
--- a/users/algorithms/recommender-overview.html
+++ b/users/algorithms/recommender-overview.html
@@ -142,7 +142,15 @@
<h1 id="recommender-overview">Recommender Overview</h1>
-<p>Recommenders have changed over the years. Mahout contains a long list of them, which you can still use. But to get the best out of our more modern aproach we’ll need to think of the Recommender as a “model creation” component—supplied by Mahout’s new spark-itemsimilarity job, and a “serving” component—supplied by a modern scalable search engine, like Solr.</p>
+<p>Recommenders have changed over the years. Mahout contains a long list of them, which you can still use. However in about 2013 there was a revolution in recommenders, which favored what we might call “Multimodal”, meaning they could take in data of all sorts—basically anything we might think was an indicator of user taste. The new Samsara algorithm, called Correlated Cross-Occurrence (CCO) is just such a next gen recommender algorithm but Mahout-Samsara only implements the model building part. This can be integrated as the user see fit and the rest of this doc will explain how.</p>
+
+<h2 id="turnkey-implementation">Turnkey Implementation</h2>
+
+<p>If you are looking for an end-to-end OSS recommender based on the Mahout CCO algorithm have a look at <a href="https://github.com/actionml/universal-recommender">The Universal Recommender</a>, which is implemented using <a href="http://predictionio.apache.org/">Apache PredictionIO</a>. See instructions for <a href="http://actionml.com/docs/pio_by_actionml">installation here</a>. There is even an AWS AMI for convenience (this is a for-pay option)</p>
+
+<h2 id="build-your-own-integration">Build Your Own Integration</h2>
+
+<p>To get the most out of our more modern CCO algorithm we’ll need to think of the Recommender as a “model creation” component—supplied by Mahout’s new spark-itemsimilarity job, and a “serving” component—supplied by a modern scalable search engine, like Solr or Elasticsearch. Here we describe a loose integration that does not require using Mahout as a library, it uses Mahout’s command line interface. This is clearly not the best but allows one to experiments and get a real recommender running easily.</p>
<p><img src="Loading Image...
@@ -154,7 +162,7 @@
<p>All ids for users and items are preserved as string tokens and so work as an external key in DBs or as doc ids for search engines, they also work as tokens for search queries.</p>
-<p>##References</p>
+<h2 id="references">References</h2>
<ol>
<li>A free ebook, which talks about the general idea: <a href="https://www.mapr.com/practical-machine-learning">Practical Machine Learning</a></li>
@@ -164,7 +172,7 @@ and <a href="http://occamsmachete.com/ml/2014/09/09/mahout-on-spark-whats-new-i
<li>A post describing the loglikelihood ratio: <a href="http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html">Surprise and Coinsidense</a> LLR is used to reduce noise in the data while keeping the calculations O(n) complexity.</li>
</ol>
-<p>##Mahout Model Creation</p>
+<h2 id="mahout-model-creation">Mahout Model Creation</h2>
<p>See the page describing <a href="http://mahout.apache.org/users/recommender/intro-cooccurrence-spark.html"><em>spark-itemsimilarity</em></a> for more details.</p>