public class ClusterSequences extends Object implements Serializable
| Constructor and Description |
|---|
ClusterSequences(int nrFractions,
Float minOverlap1,
Float minOverlap2,
Float minPercentageId) |
| Modifier and Type | Method and Description |
|---|---|
org.apache.spark.api.java.JavaPairRDD<String,Iterable<String>> |
clusterSequences(org.apache.spark.api.java.JavaSparkContext sc,
org.apache.spark.api.java.JavaPairRDD<String,String> sequences,
int nrIterations)
passes in a JavaPairRDD where the first element is the ID of a sequence, the second one the string representation
|
public org.apache.spark.api.java.JavaPairRDD<String,Iterable<String>> clusterSequences(org.apache.spark.api.java.JavaSparkContext sc, org.apache.spark.api.java.JavaPairRDD<String,String> sequences, int nrIterations)
sc - Java spark contextsequences - the sequences represented as sequence ID and sequence string..nrIterations - how many iterations of clustering should be performed. if < 1 nothing will get run.Copyright © 2016 Biojava. All Rights Reserved.