A B C D E F G H I J L M N O P Q R S T U V W Z

A

abortTask(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
AbstractBaseTest - Class in com.datasalt.pangool.utils.test
 
AbstractBaseTest() - Constructor for class com.datasalt.pangool.utils.test.AbstractBaseTest
 
AbstractHadoopTestLibrary - Class in com.datasalt.pangool.utils.test
Niceties and utilities for making Hadoop unit tests less painfully.
AbstractHadoopTestLibrary() - Constructor for class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
AbstractHadoopTestLibrary.PrintVisitor - Class in com.datasalt.pangool.utils.test
 
AbstractHadoopTestLibrary.PrintVisitor() - Constructor for class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary.PrintVisitor
 
AbstractHadoopTestLibrary.TupleVisitor - Class in com.datasalt.pangool.utils.test
 
AbstractHadoopTestLibrary.TupleVisitor() - Constructor for class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary.TupleVisitor
 
accept(Class<?>) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization
 
accept(Class<?>) - Method in class com.datasalt.pangool.serialization.ThriftSerialization
 
accept(Class<?>) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization
 
accept(Class<?>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
accept(Class<?>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
add(String, String) - Method in class com.datasalt.pangool.tuplemr.Aliases
Adds an alias
add(NamedOutputsInterface.Output) - Method in class com.datasalt.pangool.tuplemr.NamedOutputsInterface
 
add(String, Criteria.Order) - Method in class com.datasalt.pangool.tuplemr.OrderBy
Adds a new field to order by and its specified order.
add(String, Criteria.Order, Criteria.NullOrder) - Method in class com.datasalt.pangool.tuplemr.OrderBy
Adds a new field to order by and its specified order.
add(String, Criteria.Order, Criteria.NullOrder, RawComparator<?>) - Method in class com.datasalt.pangool.tuplemr.OrderBy
Same as OrderBy#add(String, Order) but adding the possibility to specify a custom comparator for that field.
addAvroSerialization(Configuration) - Static method in class com.datasalt.pangool.utils.AvroUtils
 
addClass(String, Class, String) - Method in class com.datasalt.pangool.PangoolDriver
This is the method that adds the classed to the repository
addInput(Path, InputFormat, MapOnlyMapper) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
addInput(Path, InputFormat, MapOnlyMapper, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
addInput(Path, InputFormat, TupleMapper) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Defines an input as in PangoolMultipleInputs
addInput(Path, InputFormat, TupleMapper, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
addInputContext(Job, String, String, String) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.input.PangoolMultipleInputs
Specific (key, value) configurations for each Input.
addInputPath(Job, Path, InputFormat, Mapper, Map<String, String>) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.input.PangoolMultipleInputs
Add a Path with a custom InputFormat and Mapper to the list of inputs for the map-reduce job.
addIntermediateSchema(Schema) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Adds a Map-output schema.
addNamedOutput(String, OutputFormat, Class, Class) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
addNamedOutput(String, OutputFormat, Class, Class, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
addNamedOutput(Job, String, OutputFormat, Class<?>, Class<?>) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Adds a named output for the job.
addNamedOutput(String, OutputFormat, Class, Class) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
addNamedOutput(String, OutputFormat, Class, Class, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
addNamedOutputContext(Job, String, String, String) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Added this method for allowing specific (key, value) configurations for each Output.
addNamedTupleOutput(String, Schema) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
addNamedTupleOutput(String, Schema) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
addProp(String, String) - Method in class com.datasalt.pangool.io.Schema.Field
 
addReducerContext(Reducer.Context) - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
 
addSchemaOrder(Criteria.Order) - Method in class com.datasalt.pangool.tuplemr.OrderBy
This method,unlike the traditional OrderBy#add(String, Order, NullOrder, RawComparator) method, adds a symbolic elements to order by.
addTupleInput(Path, MapOnlyMapper) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Adds an input file associated with a TupleFile.
addTupleInput(Path, Schema, MapOnlyMapper) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Adds an input file associated with a TupleFile.
addTupleInput(Path, TupleMapper<ITuple, NullWritable>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Adds an input file associated with a TupleFile.
addTupleInput(Path, Schema, TupleMapper<ITuple, NullWritable>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Adds an input file associated with a TupleFile.
Aliases - Class in com.datasalt.pangool.tuplemr
Builder that contains pairs of (alias, referenced_item).
Aliases(Map<String, String>) - Constructor for class com.datasalt.pangool.tuplemr.Aliases
 
Aliases() - Constructor for class com.datasalt.pangool.tuplemr.Aliases
 
allowedConfigDirectories - Static variable in class com.datasalt.pangool.solr.SolrRecordWriter
 
append(ITuple) - Method in class com.datasalt.pangool.io.TupleFile.Writer
Append a ITuple
assertRun(Job) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
AvroBinaryComparator - Class in com.datasalt.pangool.utils
 
AvroBinaryComparator(Schema) - Constructor for class com.datasalt.pangool.utils.AvroBinaryComparator
 
AvroFieldSerialization<T> - Class in com.datasalt.pangool.tuplemr.serialization
 
AvroFieldSerialization() - Constructor for class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization
 
AvroFieldSerialization.AvroFieldDeserializer<T> - Class in com.datasalt.pangool.tuplemr.serialization
 
AvroFieldSerialization.AvroFieldDeserializer(Schema, boolean) - Constructor for class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldDeserializer
 
AvroFieldSerialization.AvroFieldSerializer<T> - Class in com.datasalt.pangool.tuplemr.serialization
 
AvroFieldSerialization.AvroFieldSerializer(Schema, boolean) - Constructor for class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldSerializer
 
AvroInputFormat<T> - Class in com.datasalt.pangool.tuplemr.avro
This is a Pangool's implementation of AvroInputFormat.
AvroInputFormat(Schema) - Constructor for class com.datasalt.pangool.tuplemr.avro.AvroInputFormat
 
AvroInputFormat(Schema, boolean) - Constructor for class com.datasalt.pangool.tuplemr.avro.AvroInputFormat
 
AvroInputFormat<T> - Class in org.apache.avro.mapreduce.lib.input
An InputFormat for Avro data files
AvroInputFormat() - Constructor for class org.apache.avro.mapreduce.lib.input.AvroInputFormat
 
AvroOutputFormat<T> - Class in com.datasalt.pangool.tuplemr.avro
This is the Pangool's version of AvroOutputFormat.
AvroOutputFormat(Schema) - Constructor for class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
 
AvroOutputFormat(Schema, String) - Constructor for class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
 
AvroOutputFormat(Schema, String, int) - Constructor for class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
 
AvroOutputFormat<T> - Class in org.apache.avro.mapreduce.lib.output
An OutputFormat for Avro data files.
AvroOutputFormat() - Constructor for class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
 
AvroRecordReader<T> - Class in com.datasalt.pangool.tuplemr.avro
This is the pangool's implementation of AvroRecordReader.
AvroRecordReader(Schema, boolean, Configuration, FileSplit) - Constructor for class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
AvroRecordReader<T> - Class in org.apache.avro.mapreduce.lib.input
An RecordReader for Avro data files.
AvroRecordReader(Configuration, FileSplit) - Constructor for class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
AvroUtils - Class in com.datasalt.pangool.utils
 
AvroUtils() - Constructor for class com.datasalt.pangool.utils.AvroUtils
 

B

baseDir - Variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
BatchWriter - Class in com.datasalt.pangool.solr
Enables adding batches of documents to an EmbeddedSolrServer.
BatchWriter(EmbeddedSolrServer, int, TaskID, int, int) - Constructor for class com.datasalt.pangool.solr.BatchWriter
 
BitField - Class in com.datasalt.pangool.io
Bit Field class.
BitField() - Constructor for class com.datasalt.pangool.io.BitField
 
BitField.BitFieldComparator - Class in com.datasalt.pangool.io
Simple lexicographic BitField comparator
BitField.BitFieldComparator() - Constructor for class com.datasalt.pangool.io.BitField.BitFieldComparator
 
bitOnByte(int, int) - Static method in class com.datasalt.pangool.io.BitField
 
Buffer - Class in com.datasalt.pangool.utils
 
Buffer() - Constructor for class com.datasalt.pangool.utils.Buffer
 
Buffer(int) - Constructor for class com.datasalt.pangool.utils.Buffer
 
buildConf() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Creates a brand new and immutable TupleMRConfig instance.
byteBitSet(int, byte) - Static method in class com.datasalt.pangool.io.BitField
 
byteBitUnset(int, byte) - Static method in class com.datasalt.pangool.io.BitField
 
byteForBit(int) - Static method in class com.datasalt.pangool.io.BitField
Minimum size of the backing array needed for setting the given bit
BytesWritable - Class in com.datasalt.pangool.io
A byte sequence that is usable as a key or value.
BytesWritable() - Constructor for class com.datasalt.pangool.io.BytesWritable
Create a zero-size sequence.
BytesWritable(byte[]) - Constructor for class com.datasalt.pangool.io.BytesWritable
Create a BytesWritable using the byte array as the initial value.
BytesWritable.Comparator - Class in com.datasalt.pangool.io
A Comparator optimized for BytesWritable.
BytesWritable.Comparator() - Constructor for class com.datasalt.pangool.io.BytesWritable.Comparator
 

C

calculateRollupBaseFields() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the fields that are a subset from the groupBy fields and will be used when rollup is needed.
cancelHeartBeat() - Method in class com.datasalt.pangool.solr.HeartBeater
inform the background thread that this heartbeat request is not needed.
CascadingTupleInputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
A wrapper around a SequenceFile that contains Cascading's Tuples that implements a Pangool-friendly InputFormat.
CascadingTupleInputFormat(String, String...) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.CascadingTupleInputFormat
 
checkFieldType(Schema.Field) - Method in class com.datasalt.pangool.solr.DefaultTupleDocumentConverter
 
checkOutputSpecs(JobContext) - Method in class com.datasalt.pangool.solr.TupleSolrOutputFormat
 
checkOutputSpecs(JobContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.HadoopOutputFormat
 
checkOutputSpecs(JobContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
cleanup(Mapper<I1, I2, O1, O2>.Context, MultipleOutputsCollector) - Method in class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
Override this method, not the other declared as final.
cleanup(Mapper<I1, I2, O1, O2>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
 
cleanup(Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.RollupReducer
 
cleanup(Reducer<DatumWrapper<ITuple>, NullWritable, DatumWrapper<ITuple>, NullWritable>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.SimpleCombiner
 
cleanup(Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.SimpleReducer
 
cleanup(TupleMapper<INPUT_KEY, INPUT_VALUE>.TupleMRContext, TupleMapper.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleMapper
Called once at the end of the task.
cleanup(Mapper<INPUT_KEY, INPUT_VALUE, DatumWrapper<ITuple>, NullWritable>.Context) - Method in class com.datasalt.pangool.tuplemr.TupleMapper
Do not override.
cleanup(TupleReducer.TupleMRContext, TupleReducer<OUTPUT_KEY, OUTPUT_VALUE>.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleReducer
 
cleanUp() - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
cleanUpInstanceFiles() - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Run this method after running your Job for instance files to be properly cleaned.
cleanUpInstanceFiles() - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Run this method after running your Job for instance files to be properly cleaned.
clear() - Method in class com.datasalt.pangool.io.BitField
Clears the bit field, unsetting all bits.
clear() - Method in interface com.datasalt.pangool.io.ITuple
 
clear() - Method in class com.datasalt.pangool.io.Tuple
 
clear() - Method in class com.datasalt.pangool.io.ViewTuple
 
cloneField(Schema.Field, String, boolean) - Static method in class com.datasalt.pangool.io.Schema.Field
Clones a Field with a new name.
cloneField(Schema.Field, String) - Static method in class com.datasalt.pangool.io.Schema.Field
Clones a Field with a new name.
close() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Close the file.
close() - Method in class com.datasalt.pangool.io.TupleFile.Writer
Close the file.
close() - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffDeserializer
 
close() - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffSerializer
 
close() - Method in class com.datasalt.pangool.serialization.ThriftSerialization.TDeserializerAdapter
 
close() - Method in class com.datasalt.pangool.serialization.ThriftSerialization.TSerializerAdapter
 
close(TaskAttemptContext, SolrCore) - Method in class com.datasalt.pangool.solr.BatchWriter
 
close(TaskAttemptContext) - Method in class com.datasalt.pangool.solr.SolrRecordWriter
 
close() - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
close() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
 
close() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
close() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
close() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Closes all the opened outputs.
close() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.CustomCSVWriter
 
close(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.TupleTextRecordWriter
 
close() - Method in class com.datasalt.pangool.tuplemr.MultipleOutputsCollector
 
close() - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldDeserializer
 
close() - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldSerializer
 
close() - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
close() - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
close() - Method in class com.datasalt.pangool.tuplemr.serialization.TupleDeserializer
 
close() - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerializer
 
close() - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
CollectionInputFormat<K,V> - Class in com.datasalt.pangool.utils.test
A base input format for use a Collection as input for a MapReduce Job.
CollectionInputFormat() - Constructor for class com.datasalt.pangool.utils.test.CollectionInputFormat
 
CollectionInputFormat.CustomInputSplit - Class in com.datasalt.pangool.utils.test
 
CollectionInputFormat.CustomInputSplit() - Constructor for class com.datasalt.pangool.utils.test.CollectionInputFormat.CustomInputSplit
 
CollectionInputFormat.Duple<K,V> - Class in com.datasalt.pangool.utils.test
 
CollectionInputFormat.Duple(K, V) - Constructor for class com.datasalt.pangool.utils.test.CollectionInputFormat.Duple
 
com.datasalt.pangool - package com.datasalt.pangool
 
com.datasalt.pangool.io - package com.datasalt.pangool.io
 
com.datasalt.pangool.pig - package com.datasalt.pangool.pig
 
com.datasalt.pangool.serialization - package com.datasalt.pangool.serialization
 
com.datasalt.pangool.solr - package com.datasalt.pangool.solr
 
com.datasalt.pangool.tuplemr - package com.datasalt.pangool.tuplemr
 
com.datasalt.pangool.tuplemr.avro - package com.datasalt.pangool.tuplemr.avro
 
com.datasalt.pangool.tuplemr.mapred - package com.datasalt.pangool.tuplemr.mapred
 
com.datasalt.pangool.tuplemr.mapred.lib.input - package com.datasalt.pangool.tuplemr.mapred.lib.input
 
com.datasalt.pangool.tuplemr.mapred.lib.output - package com.datasalt.pangool.tuplemr.mapred.lib.output
 
com.datasalt.pangool.tuplemr.serialization - package com.datasalt.pangool.tuplemr.serialization
 
com.datasalt.pangool.utils - package com.datasalt.pangool.utils
 
com.datasalt.pangool.utils.test - package com.datasalt.pangool.utils.test
 
commitTask(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
CommonUtils - Class in com.datasalt.pangool.utils
 
CommonUtils() - Constructor for class com.datasalt.pangool.utils.CommonUtils
 
compare(BitField, BitField) - Method in class com.datasalt.pangool.io.BitField.BitFieldComparator
 
compare(byte[], int, int, byte[], int, int) - Method in class com.datasalt.pangool.io.BytesWritable.Comparator
Compare the buffers in serialized form.
compare(ITuple, ITuple) - Method in class com.datasalt.pangool.tuplemr.mapred.GroupComparator
Never called in MapRed jobs.
compare(byte[], int, int, byte[], int, int) - Method in class com.datasalt.pangool.tuplemr.mapred.GroupComparator
 
compare(Object, Serializer, Object, Serializer) - Method in class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
compare(Object, Object) - Method in class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
compare(ITuple, ITuple) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
Never called in MapRed jobs.
compare(Schema, Criteria, ITuple, int[], ITuple, int[], Serializer[]) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
compare(byte[], int, int, byte[], int, int) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
compare(byte[], int, byte[], int, Schema, Criteria, SortComparator.Offsets, SortComparator.Nulls) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
compare(Object, Object) - Method in class com.datasalt.pangool.utils.AvroBinaryComparator
 
compare(byte[], int, int, byte[], int, int) - Method in class com.datasalt.pangool.utils.AvroBinaryComparator
 
compareMultipleSources(byte[], int, int, byte[], int, int) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
compareObjects(Object, Object, RawComparator, Schema.Field.Type, Serializer) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
Compares two objects.
compareObjects(Object, Object) - Static method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
compareTo(BitField) - Method in class com.datasalt.pangool.io.BitField
 
conf - Variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
conf - Variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
conf - Variable in class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
conf - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
CONF_COMBINER_HANDLER - Static variable in class com.datasalt.pangool.tuplemr.mapred.SimpleCombiner
 
CONF_COMPARATOR_INSTANCES - Static variable in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
CONF_COMPARATOR_REFERENCES - Static variable in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
CONF_REDUCER_HANDLER - Static variable in class com.datasalt.pangool.tuplemr.mapred.SimpleReducer
 
CONF_SCHEMA_VALIDATION - Static variable in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
Configuration parameter to enable the Schema strict validation.
When schema validation is set, the schema of the tuples emitted through the TupleMapper collector or TupleOutputFormat are validated, i.e.
configureJob(Job) - Method in class com.datasalt.pangool.tuplemr.MultipleInputsInterface
Use this method for configuring a Job instance according to the multiple input specs that has been specified.
configureJob(Job) - Method in class com.datasalt.pangool.tuplemr.NamedOutputsInterface
Use this method for configuring a Job instance according to the named outputs specs that has been specified.
containsBeforeSchemaOrder(String) - Method in class com.datasalt.pangool.tuplemr.OrderBy
True if field was added before calling #addSchemaOrder(Order)
containsField(String) - Method in class com.datasalt.pangool.io.Schema
 
containsFieldName(String) - Method in class com.datasalt.pangool.tuplemr.OrderBy
True if field was added using #add(String, Order)
containsFieldUsingAlias(Schema, String, Map<String, String>) - Static method in class com.datasalt.pangool.io.Schema
 
containsNullableFields() - Method in class com.datasalt.pangool.io.Schema
 
context - Variable in class com.datasalt.pangool.tuplemr.TupleReducer.StaticCollector
 
convert(ITuple, NullWritable) - Method in class com.datasalt.pangool.solr.DefaultTupleDocumentConverter
 
convert(ITuple, NullWritable) - Method in interface com.datasalt.pangool.solr.TupleDocumentConverter
 
COUNTER - Static variable in class com.datasalt.pangool.io.Mutator
 
create(String, Schema.Field.Type, boolean, Object) - Static method in class com.datasalt.pangool.io.Schema.Field
Crates a field of the given type.
create(String, Schema.Field.Type, boolean) - Static method in class com.datasalt.pangool.io.Schema.Field
Crates a field of the given type.
create(String, Schema.Field.Type) - Static method in class com.datasalt.pangool.io.Schema.Field
Crates a non nullable field of the given type.
createAvroField(String, Schema) - Static method in class com.datasalt.pangool.io.Fields
 
createAvroField(String, Schema, boolean) - Static method in class com.datasalt.pangool.io.Fields
Creates a field containing an Avro object that will be serialized using AvroFieldSerialization
createConf() - Method in class com.datasalt.pangool.utils.test.AbstractBaseTest
 
createEnum(String, Class<?>, boolean, Object) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates an enum field, based in a enum class
createEnum(String, Class<?>, boolean) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates an enum field, based in a enum class
createEnum(String, Class<?>) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates a non-nullable enum field, based in a enum class
createJob() - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
createJob() - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
createNewConfiguration() - Static method in class com.datasalt.pangool.utils.test.AbstractBaseTest
 
createObject(String, Class<?>, boolean) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates an object field.
createObject(String, Class<?>) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates a non nullable object field.
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.avro.AvroInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.CascadingTupleInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HadoopInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HCatTupleInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat
 
createRecordReader(InputSplit, TaskAttemptContext) - Method in class org.apache.avro.mapreduce.lib.input.AvroInputFormat
 
createTupleField(String, Schema) - Static method in class com.datasalt.pangool.io.Fields
Deprecated. Use Schema.Field.createTupleField(String, Schema) instead}
createTupleField(String, Schema, boolean) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates a field containing a Pangool Tuple.
createTupleField(String, Schema) - Static method in class com.datasalt.pangool.io.Schema.Field
Creates a non-nullable field containing a Pangool Tuple.
Criteria - Class in com.datasalt.pangool.tuplemr
Criteria specifies a sorting criteria defined by a list of Criteria.SortElement instances that contain a field name and its corresponding order defined by Criteria.Order.
Criteria(List<Criteria.SortElement>) - Constructor for class com.datasalt.pangool.tuplemr.Criteria
 
Criteria() - Constructor for class com.datasalt.pangool.tuplemr.Criteria
 
Criteria.NullOrder - Enum in com.datasalt.pangool.tuplemr
 
Criteria.Order - Enum in com.datasalt.pangool.tuplemr
 
Criteria.SortElement - Class in com.datasalt.pangool.tuplemr
Basic Criteria element.
Criteria.SortElement(String, Criteria.Order, Criteria.NullOrder) - Constructor for class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
Criteria.SortElement(String, Criteria.Order, Criteria.NullOrder, RawComparator<?>) - Constructor for class com.datasalt.pangool.tuplemr.Criteria.SortElement
 

D

dataToServe() - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat
Override and return the list of Key/Value pairs to be served by this input format.
datum() - Method in class com.datasalt.pangool.io.DatumWrapper
 
datum(T) - Method in class com.datasalt.pangool.io.DatumWrapper
 
DatumWrapper<T> - Class in com.datasalt.pangool.io
 
DatumWrapper(T) - Constructor for class com.datasalt.pangool.io.DatumWrapper
 
DatumWrapper(T, T) - Constructor for class com.datasalt.pangool.io.DatumWrapper
 
DatumWrapper() - Constructor for class com.datasalt.pangool.io.DatumWrapper
 
deepCopy() - Method in class com.datasalt.pangool.io.Tuple
Performs a deep copy of this Tuple.
deepCopy(Map<String, FieldClonator>) - Method in class com.datasalt.pangool.io.Tuple
Performs a deep copy of this Tuple.
deepCopy(ITuple) - Static method in class com.datasalt.pangool.io.Tuple
Performs a deep copy of the given Tuple.
deepCopy(ITuple, Map<String, FieldClonator>) - Static method in class com.datasalt.pangool.io.Tuple
Performs a deep copy of the given Tuple.
DEFAULT_DEFLATE_LEVEL - Static variable in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
The default deflate level.
DEFAULT_DEFLATE_LEVEL - Static variable in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
The default deflate level.
DEFAULT_HDFS_TMP_FOLDER_CONF_VALUE - Static variable in class com.datasalt.pangool.utils.InstancesDistributor
 
DefaultTupleDocumentConverter - Class in com.datasalt.pangool.solr
This default TupleDocumentConverter maps a ITuple to a SolrInputDocument using primitive Schema.Field types.
DefaultTupleDocumentConverter() - Constructor for class com.datasalt.pangool.solr.DefaultTupleDocumentConverter
 
DEFLATE_LEVEL_KEY - Static variable in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
The configuration key for Avro deflate level.
DEFLATE_LEVEL_KEY - Static variable in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
The configuration key for Avro deflate level.
delegate - Variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingMapper
 
DelegatingInputFormat<K,V> - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
An InputFormat that delegates behavior of paths to multiple other InputFormats.
DelegatingInputFormat() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingInputFormat
 
DelegatingMapper - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
An Mapper that delegates behavior of paths to multiple other mappers.
DelegatingMapper() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingMapper
 
DelegatingRecordReader<K,V> - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
This is a delegating RecordReader, which delegates the functionality to the underlying record reader in TaggedInputSplit
DelegatingRecordReader(InputSplit, TaskAttemptContext) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
Constructs the DelegatingRecordReader.
deleteIfExists(FileSystem, Path) - Static method in class com.datasalt.pangool.utils.HadoopUtils
 
deser(DataInput) - Method in class com.datasalt.pangool.io.BitField
Deserialize a BitField serialized using BitField.ser(java.io.DataOutput).
deser(byte[], int) - Method in class com.datasalt.pangool.io.BitField
Deserialize a BitField serialized from a byte array.
deser(Object, InputStream) - Method in class com.datasalt.pangool.serialization.HadoopSerialization
Deseerializes into the given object using the Hadoop serialization system.
deser(Class, InputStream) - Method in class com.datasalt.pangool.serialization.HadoopSerialization
Return a new instance of the given class with the deserialized data from the input stream.
deser(Object, byte[], int, int) - Method in class com.datasalt.pangool.serialization.HadoopSerialization
Deserialize an object using Hadoop serialization from a byte array.
deserialize(T) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffDeserializer
 
deserialize(TBase) - Method in class com.datasalt.pangool.serialization.ThriftSerialization.TDeserializerAdapter
 
deserialize(T) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldDeserializer
 
deserialize(ITuple) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
deserialize(DatumWrapper<ITuple>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleDeserializer
 
disableSchemaValidation(Configuration) - Static method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
see TupleSerialization.CONF_SCHEMA_VALIDATION
disableSerialization(Configuration) - Static method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
Use this method to disable this serialization in Hadoop
distribute(Object, String, Configuration) - Static method in class com.datasalt.pangool.utils.InstancesDistributor
Utility method for serializing an object and saving it in a way that later can be recovered anywhere in the cluster.
driver(String[]) - Method in class com.datasalt.pangool.PangoolDriver
This is a driver for the example programs.
dumpOutput(String) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
Dumps to string the given output

E

enableProtoStuffSerialization(Configuration) - Static method in class com.datasalt.pangool.serialization.ProtoStuffSerialization
Enables ProtoStuff Serialization support in Hadoop.
enableSchemaValidation(Configuration) - Static method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
see TupleSerialization.CONF_SCHEMA_VALIDATION
enableSerialization(Configuration) - Static method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
Use this method to enable this serialization in Hadoop
enableThriftSerialization(Configuration) - Static method in class com.datasalt.pangool.serialization.ThriftSerialization
Enables Thrift Serialization support in Hadoop.
ensureOutput(String) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
ensureSpace(int) - Method in class com.datasalt.pangool.io.BitField
Ensures a minimum size for the backing byte array
equals(Object) - Method in class com.datasalt.pangool.io.BytesWritable
Are the two byte sequences equal?
equals(Object) - Method in class com.datasalt.pangool.io.Schema
 
equals(Object) - Method in class com.datasalt.pangool.io.Schema.Field
 
equals(Object) - Method in class com.datasalt.pangool.io.Tuple
 
equals(Object) - Method in class com.datasalt.pangool.tuplemr.Criteria
 
equals(Object) - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
equals(Object) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
EXT - Static variable in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
The file name extension for avro data files.
EXT - Static variable in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
The file name extension for avro data files.

F

failIfEmpty(Collection<?>, String) - Static method in exception com.datasalt.pangool.tuplemr.TupleMRException
 
failIfEmpty(Object[], String) - Static method in exception com.datasalt.pangool.tuplemr.TupleMRException
 
failIfNotNull(Object, String) - Static method in exception com.datasalt.pangool.tuplemr.TupleMRException
 
failIfNull(Object, String) - Static method in exception com.datasalt.pangool.tuplemr.TupleMRException
 
FieldClonator - Interface in com.datasalt.pangool.io
Used by Tuple.deepCopy(ITuple, java.util.Map) to allow deep copy of objects that Pangool don't know how to copy.
Fields - Class in com.datasalt.pangool.io
 
Fields() - Constructor for class com.datasalt.pangool.io.Fields
 
FileSplit - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
A section of an input file.
FileSplit(Path, long, long, String[]) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
Constructs a split with host information
fileToString(FileSystem, Path) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Reads the content of a file into a String.
firstMapOutput(String) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
firstMapperMultiOutput(String, String) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
firstReducerMultiOutput(String, String) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
firstReducerOutput(String) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
FixedWidthCSVTokenizer - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
A custom CSVTokenizer for input lines with fixed width fields.
FixedWidthCSVTokenizer(int[], String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.FixedWidthCSVTokenizer
Fixed width tokeninzer fields: must contains pairs of positions, indicating the location of fields in the line.
fS - Variable in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 

G

get(int) - Method in interface com.datasalt.pangool.io.ITuple
 
get(String) - Method in interface com.datasalt.pangool.io.ITuple
 
get(int) - Method in class com.datasalt.pangool.io.Tuple
 
get(String) - Method in class com.datasalt.pangool.io.Tuple
 
get(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
get(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
get(Configuration) - Static method in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
get(Configuration, JobID) - Static method in class com.datasalt.pangool.utils.JobContextFactory
 
get(Configuration, TaskAttemptID) - Static method in class com.datasalt.pangool.utils.TaskAttemptContextFactory
 
getAbreviation() - Method in enum com.datasalt.pangool.tuplemr.Criteria.Order
 
getAliases() - Method in class com.datasalt.pangool.tuplemr.Aliases
 
getAllowedConfigDirectories() - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
Return the list of directories names that may be included in the configuration data passed to the tasks.
getBackingArray() - Method in class com.datasalt.pangool.io.BitField
Returns the backing array keeping the bits.
getBaseDir() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
getBatchWriteException() - Method in class com.datasalt.pangool.solr.BatchWriter
 
getBoolean(int) - Method in interface com.datasalt.pangool.io.ITuple
 
getBoolean(String) - Method in interface com.datasalt.pangool.io.ITuple
 
getBoolean(int) - Method in class com.datasalt.pangool.io.Tuple
 
getBoolean(String) - Method in class com.datasalt.pangool.io.Tuple
 
getBoolean(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
getBoolean(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
getBytes() - Method in class com.datasalt.pangool.io.BytesWritable
Get the data from the BytesWritable.
getBytes() - Method in class com.datasalt.pangool.utils.Buffer
 
getCapacity() - Method in class com.datasalt.pangool.io.BytesWritable
Get the capacity, which is the maximum size that could handled without resizing the backing storage.
getCapacity() - Method in class com.datasalt.pangool.utils.Buffer
Get the capacity, which is the maximum size that could handled without resizing the backing storage.
getCommonCriteria() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the criteria used to sort fields that are common among the intermediate schemas.
getCommonSchema() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Returns the schema that contains fields that will be hadoopSer/deserialized before the schemaId.
getCommonSchemaDeserializers() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getCommonSchemaIndexTranslation(int) - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Given a intermediate schema id, returns an index correlation from common schema indexes to the specified intermediate schema indexes.
getCommonSchemaSerializers() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getCompressionCodec() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Returns the compression codec of data in this file.
getCompressionCodec() - Method in class com.datasalt.pangool.io.TupleFile.Writer
Returns the compression codec of data in this file.
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HadoopInputFormat
 
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.HadoopOutputFormat
 
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
getConf() - Method in class com.datasalt.pangool.tuplemr.mapred.TupleHashPartitioner
 
getConf() - Method in class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
getConf() - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
getConf() - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
getConf() - Method in class com.datasalt.pangool.utils.AvroBinaryComparator
 
getConf() - Method in class com.datasalt.pangool.utils.test.AbstractBaseTest
 
getConfig() - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
getCountersEnabled(JobContext) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Returns if the counters for the named outputs are enabled or not.
getCurrentKey() - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
getCurrentKey() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
 
getCurrentKey() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
getCurrentKey() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
getCurrentKey() - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
getCurrentValue() - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
getCurrentValue() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
 
getCurrentValue() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
getCurrentValue() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
getCurrentValue() - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
getCustomComparator() - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
getCustomPartitionFields() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the custom fields used to partition tuples.
getDefaultValue() - Method in class com.datasalt.pangool.io.Schema.Field
 
getDeserializer(Class<Schema>) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization
 
getDeserializer(Class<TBase>) - Method in class com.datasalt.pangool.serialization.ThriftSerialization
 
getDeserializer(Class<T>) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization
 
getDeserializer(Class<ITuple>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
getDeserializer(Class<DatumWrapper<ITuple>>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
getDeserializer() - Static method in class com.datasalt.pangool.utils.ThriftUtils
Return a Thrift deserializer.
getDeserializers(Schema, Schema, Configuration) - Static method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getDouble(int) - Method in interface com.datasalt.pangool.io.ITuple
 
getDouble(String) - Method in interface com.datasalt.pangool.io.ITuple
 
getDouble(int) - Method in class com.datasalt.pangool.io.Tuple
 
getDouble(String) - Method in class com.datasalt.pangool.io.Tuple
 
getDouble(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
getDouble(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
getElements() - Method in class com.datasalt.pangool.tuplemr.Criteria
 
getElements() - Method in class com.datasalt.pangool.tuplemr.OrderBy
Returns a Criteria.SortElement object for every field added to this builder.
getEscapeCharacter() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getField(String) - Method in class com.datasalt.pangool.io.Schema
 
getField(int) - Method in class com.datasalt.pangool.io.Schema
 
getFieldAliases(String) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
getFieldPos(String) - Method in class com.datasalt.pangool.io.Schema
 
getFieldPosUsingAliases(Schema, String, Map<String, String>) - Static method in class com.datasalt.pangool.io.Schema
 
getFields() - Method in class com.datasalt.pangool.io.Schema
 
getFieldsToPartition(int) - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Given a schema returns the fields (indexes) that will be used to calculate a partial hashing by TupleHashPartitioner
getFieldUsingAliases(Schema, String, Map<String, String>) - Static method in class com.datasalt.pangool.io.Schema
 
getFirst() - Method in class com.datasalt.pangool.utils.Pair
 
getFixedWidthFieldsPositions() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getFloat(int) - Method in interface com.datasalt.pangool.io.ITuple
 
getFloat(String) - Method in interface com.datasalt.pangool.io.ITuple
 
getFloat(int) - Method in class com.datasalt.pangool.io.Tuple
 
getFloat(String) - Method in class com.datasalt.pangool.io.Tuple
 
getFloat(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
getFloat(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
getGroupByFields() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the fields that are common among all the intermediate schemas that will be used to group by the tuples emitted from the TupleMapper
getGroupSchema() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Returns the schema containing the group-by fields ordered by the common sorting criteria.
getGroupSchemaDeserializers() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getGroupSchemaIndexTranslation(int) - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Given a intermediate schema id, returns an index correlation from the group schema to the intermediate schema.
getGroupSchemaSerializers() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getHadoopContext() - Method in class com.datasalt.pangool.tuplemr.TupleMapper.StaticTupleMRContext
Return the Hadoop MapContext.
getHadoopContext() - Method in class com.datasalt.pangool.tuplemr.TupleReducer.TupleMRContext
Returns the Hadoop ReduceContext.
getInput() - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
getInputFormatFile() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
getInputProcessorFile() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
getInputSplit() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
Retrieves the original InputSplit.
getInteger(int) - Method in interface com.datasalt.pangool.io.ITuple
 
getInteger(String) - Method in interface com.datasalt.pangool.io.ITuple
 
getInteger(int) - Method in class com.datasalt.pangool.io.Tuple
 
getInteger(String) - Method in class com.datasalt.pangool.io.Tuple
 
getInteger(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
getInteger(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
getIntermediateSchema(String) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns a defined intermediate schema with the specified name
getIntermediateSchema(int) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns a defined intermediate schema with the specified schemaId.
The schemaId follows the order of schema definition in TupleMRConfig.addIntermediateSchema(Schema)
getIntermediateSchemaNames() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns a list with the names of all the intermediate schemas.
getIntermediateSchemas() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns all the intermediate schemas defined.
getKey() - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.Duple
 
getLength() - Method in class com.datasalt.pangool.io.BytesWritable
Get the current size of the buffer.
getLength() - Method in class com.datasalt.pangool.io.TupleFile.Writer
Returns the current length of the output file.
getLength() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
The number of bytes in the file to process.
getLength() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
getLength() - Method in class com.datasalt.pangool.utils.Buffer
Get the current size of the buffer.
getLength() - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.CustomInputSplit
 
getLocations() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
 
getLocations() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
getLocations() - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.CustomInputSplit
 
getLong(int) - Method in interface com.datasalt.pangool.io.ITuple
 
getLong(String) - Method in interface com.datasalt.pangool.io.ITuple
 
getLong(int) - Method in class com.datasalt.pangool.io.Tuple
 
getLong(String) - Method in class com.datasalt.pangool.io.Tuple
 
getLong(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
getLong(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
getMaxFieldSize() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.NullableCSVTokenizer
 
getMessage() - Method in exception com.datasalt.pangool.tuplemr.mapred.lib.input.InvalidInputException
Get a summary message of the problems found.
getMetadata() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Returns the metadata object of the file
getMultiInputs() - Method in class com.datasalt.pangool.tuplemr.MultipleInputsInterface
 
getName() - Method in class com.datasalt.pangool.io.Schema.Field
 
getName() - Method in class com.datasalt.pangool.io.Schema
 
getName() - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
getNamedOutput(String) - Method in class com.datasalt.pangool.tuplemr.MultipleOutputsCollector
 
getNamedOutputs() - Method in class com.datasalt.pangool.tuplemr.NamedOutputsInterface
 
getNullableFieldsIdx() - Method in class com.datasalt.pangool.io.Schema
 
getNullablePositionFromIndex(int) - Method in class com.datasalt.pangool.io.Schema
Return the position on the array returned by Schema.getNullableFieldsIdx() for a given field's index.
getNullOrder() - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
getNullString() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getNumIntermediateSchemas() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the number of intermediate schemas defined
getObjectClass() - Method in class com.datasalt.pangool.io.Schema.Field
 
getObjectSerialization() - Method in class com.datasalt.pangool.io.Schema.Field
 
getOrder() - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
getOut() - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
getOutputCommitter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.HadoopOutputFormat
 
getOutputCommitter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
getOutputFormat() - Method in class com.datasalt.pangool.pig.PangoolStoreFunc
 
getPangoolSchema() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HCatTupleInputFormat
 
getPartition(DatumWrapper<ITuple>, NullWritable, int) - Method in class com.datasalt.pangool.tuplemr.mapred.TupleHashPartitioner
 
getPartitionFieldsIndexes() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getPath() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
The file containing this split's data.
getPos() - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
getPos() - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
getPosition() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Return the current byte position in the input file.
getProblems() - Method in exception com.datasalt.pangool.tuplemr.mapred.lib.input.InvalidInputException
Get the complete list of the problems reported.
getProgress() - Method in class com.datasalt.pangool.solr.HeartBeater
 
getProgress() - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
getProgress() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
 
getProgress() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
Return the progress within the input split
getProgress() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
getProgress() - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
getProp(String) - Method in class com.datasalt.pangool.io.Schema.Field
 
getProps() - Method in class com.datasalt.pangool.io.Schema.Field
 
getQuoteCharacter() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getRecordWriter(TaskAttemptContext) - Method in class com.datasalt.pangool.solr.TupleSolrOutputFormat
 
getRecordWriter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
 
getRecordWriter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.HadoopOutputFormat
 
getRecordWriter(String) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
 
getRecordWriter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
getRecordWriter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleOutputFormat
 
getRecordWriter(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat
 
getRecordWriter(TaskAttemptContext) - Method in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
 
getRollupFrom() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the field from which the rollup will be performed
getSchema() - Method in interface com.datasalt.pangool.io.ITuple
 
getSchema() - Method in class com.datasalt.pangool.io.Tuple
 
getSchema() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Return the tuple's Schema in the file.
getSchema() - Method in class com.datasalt.pangool.io.ViewTuple
 
getSchema() - Method in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
 
getSchema() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HCatTupleInputFormat
 
getSchema() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getSchemaFieldAliases() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns a map that contains for every schema a list of field aliases.
getSchemaIdByName(String) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the schemaId from the schema's name.
getSchemaOrder() - Method in class com.datasalt.pangool.tuplemr.OrderBy
Gets the schemaOrder if set.
getSchemaOrderIndex() - Method in class com.datasalt.pangool.tuplemr.OrderBy
Returns the position in the list where schemaOrder was added using OrderBy#addSchemaOrder(Order)
getSchemasOrder() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the order that will be used to sort tuples with different schemas after being compared by commonOrder.
getSchemaValidation(Configuration) - Static method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
see TupleSerialization.CONF_SCHEMA_VALIDATION
getSecond() - Method in class com.datasalt.pangool.utils.Pair
 
getSeparatorCharacter() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getSerializationInfo() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the SerializationInfo instance related to this configuration.
getSerializer(Class<Schema>) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization
 
getSerializer(Class<TBase>) - Method in class com.datasalt.pangool.serialization.ThriftSerialization
 
getSerializer(Class<T>) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization
 
getSerializer(Class<ITuple>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
getSerializer(Class<DatumWrapper<ITuple>>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
getSerializer() - Static method in class com.datasalt.pangool.utils.ThriftUtils
Return a Thrift serializer.
getSerializers(Schema, Configuration) - Static method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getSpecificOrderBys() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the order that will be used to sort tuples with different schemas after being compared by commonOrder and schemaOrder.
getSpecificSchema(int) - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Given a intermediate schema id it returns a subschema from that intermediate schema that contains fields that will be serialized after the schemaId.
getSpecificSchemaDeserializers() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getSpecificSchemaIndexTranslation(int) - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Given a intermediate schema id, returns an index correlation from the specific schema to the intermediate schema.
getSpecificSchemas() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
Returns a list containing all the specific schemas ordered by schema id.
getSpecificSchemaSerializers() - Method in class com.datasalt.pangool.tuplemr.SerializationInfo
 
getSplits(JobContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingInputFormat
 
getSplits(JobContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HadoopInputFormat
 
getSplits(JobContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HCatTupleInputFormat
 
getSplits(JobContext) - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat
 
getStart() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
The position of the first byte in the file to process.
getString(int) - Method in interface com.datasalt.pangool.io.ITuple
 
getString(String) - Method in interface com.datasalt.pangool.io.ITuple
 
getString(int) - Method in class com.datasalt.pangool.io.Tuple
 
getString(String) - Method in class com.datasalt.pangool.io.Tuple
 
getString(int) - Method in class com.datasalt.pangool.io.ViewTuple
 
getString(String) - Method in class com.datasalt.pangool.io.ViewTuple
 
getTupleMRConfig() - Method in class com.datasalt.pangool.tuplemr.TupleMapper.StaticTupleMRContext
 
getTupleMRConfig() - Method in class com.datasalt.pangool.tuplemr.TupleReducer.TupleMRContext
 
getType() - Method in class com.datasalt.pangool.io.Schema.Field
 
getType() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
getValue() - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.Duple
 
giveMeACopy(Object) - Method in interface com.datasalt.pangool.io.FieldClonator
Must return a new copy of the given value.
GroupComparator - Class in com.datasalt.pangool.tuplemr.mapred
Tuple-based MapRed jobs binary group comparator.
GroupComparator() - Constructor for class com.datasalt.pangool.tuplemr.mapred.GroupComparator
 

H

HadoopInputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
 
HadoopInputFormat(Class<? extends InputFormat>) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.HadoopInputFormat
 
HadoopOutputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
 
HadoopOutputFormat(Class<? extends OutputFormat>) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.HadoopOutputFormat
 
hadoopSer - Variable in class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
HadoopSerialization - Class in com.datasalt.pangool.serialization
You can use this utility class to serialize / deserialize anything in the Hadoop context.
HadoopSerialization(Configuration) - Constructor for class com.datasalt.pangool.serialization.HadoopSerialization
 
HadoopUtils - Class in com.datasalt.pangool.utils
Put here usefull methods for manipulating things in the HDFS, etc.
HadoopUtils() - Constructor for class com.datasalt.pangool.utils.HadoopUtils
 
hashBytes(byte[], int, int) - Static method in class com.datasalt.pangool.tuplemr.mapred.TupleHashPartitioner
 
hashCode() - Method in class com.datasalt.pangool.io.BytesWritable
 
hashCode() - Method in class com.datasalt.pangool.io.Schema.Field
 
hashCode() - Method in class com.datasalt.pangool.io.Schema
 
hashCode() - Method in class com.datasalt.pangool.io.Tuple
 
hashCode() - Method in class com.datasalt.pangool.tuplemr.Criteria
 
hashCode() - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
hashCode() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
hasNext() - Method in class com.datasalt.pangool.tuplemr.mapred.TupleIterator
 
HCatTupleInputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
A bridge between HCatalog and Pangool that makes any HCatInputFormat compatible with Pangool.
HCatTupleInputFormat(String, String, Configuration) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.HCatTupleInputFormat
 
HDFS_TMP_FOLDER_CONF - Static variable in class com.datasalt.pangool.utils.InstancesDistributor
 
HeartBeater - Class in com.datasalt.pangool.solr
This class runs a background thread that once ever 5 seconds checks to see if a progress report is needed.
HeartBeater(Progressable) - Constructor for class com.datasalt.pangool.solr.HeartBeater
Create the heart beat object thread set it to daemon priority and start the thread.

I

IdentityTupleMapper - Class in com.datasalt.pangool.tuplemr
 
IdentityTupleMapper() - Constructor for class com.datasalt.pangool.tuplemr.IdentityTupleMapper
 
IdentityTupleReducer - Class in com.datasalt.pangool.tuplemr
 
IdentityTupleReducer() - Constructor for class com.datasalt.pangool.tuplemr.IdentityTupleReducer
 
incCounter(TaskInputOutputContext, String, Enum) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Utility for doing ctx.getCounter(groupName, counter.toString()).increment(1);
incrementCounter(TaskID, String, String, long) - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
 
init(FileReader<T>, FileSplit) - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
init(FileReader<T>, FileSplit) - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
initHadoop() - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
initialize(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
initialize(FileSplit, Configuration) - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
initialize(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
 
initialize(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
initialize(InputSplit, TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
initialize(InputSplit, TaskAttemptContext) - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
initialize(FileSplit, Configuration) - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
initializeComparators(Configuration, TupleMRConfig) - Static method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Initializes the custom comparator instances inside the given config criterias, calling the Configurable.setConf(Configuration) method.
inputs - Variable in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
InstancesDistributor - Class in com.datasalt.pangool.utils
This class contains useful methods for serializing/deserializing instances that implement Serializable.
InstancesDistributor() - Constructor for class com.datasalt.pangool.utils.InstancesDistributor
 
InvalidInputException - Exception in com.datasalt.pangool.tuplemr.mapred.lib.input
This class wraps a list of problems with the input, so that the user can get a list of problems together instead of finding and fixing them one by one.
InvalidInputException(List<IOException>) - Constructor for exception com.datasalt.pangool.tuplemr.mapred.lib.input.InvalidInputException
Create the exception with the given list.
invertMap(Map<?, ?>) - Static method in class com.datasalt.pangool.utils.CommonUtils
 
isBlockCompressed() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Returns true if records are block-compressed.
isClosing() - Method in class com.datasalt.pangool.solr.SolrRecordWriter
 
isCompressed() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Returns true if values are compressed.
isHasHeader() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
isMultipleSources - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
isNullable() - Method in class com.datasalt.pangool.io.Schema.Field
 
isRequiredConfigDirectory(String) - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
check if the passed in directory is required to be present in the configuration data set.
isSet(int) - Method in class com.datasalt.pangool.io.BitField
Returns the value of a given bit.
isSplitable(JobContext, Path) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
isStrictQuotes() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
iterator() - Method in class com.datasalt.pangool.tuplemr.mapred.TupleIterator
 
ITuple - Interface in com.datasalt.pangool.io
This is the common interface implemented by tuples.

J

JobContextFactory - Class in com.datasalt.pangool.utils
A trick that uses reflection to be compatible with both Hadoop 1.0 and Hadoop 2.0 In Hadoop 2.0 JobContext is an interface and cannot be instantiated.
JobContextFactory() - Constructor for class com.datasalt.pangool.utils.JobContextFactory
 
join(String[], String) - Static method in class com.datasalt.pangool.utils.Strings
 
join(Collection<String>, String) - Static method in class com.datasalt.pangool.utils.Strings
 
jointSchema(Schema, Schema) - Static method in class com.datasalt.pangool.io.Mutator
Creates a joint schema between two Schemas.
jointSchema(String, Schema, Schema) - Static method in class com.datasalt.pangool.io.Mutator
Creates a joint schema between two Schemas.

L

listStatus(JobContext) - Method in class com.datasalt.pangool.tuplemr.avro.AvroInputFormat
 
listStatus(JobContext) - Method in class org.apache.avro.mapreduce.lib.input.AvroInputFormat
 
loadInstance(Configuration, Class<T>, String, boolean) - Static method in class com.datasalt.pangool.utils.InstancesDistributor
Given a Hadoop Configuration property and an Class, this method can re-instantiate an Object instance that was previously distributed using * InstancesDistributor.distribute(Object, String, Configuration).
LOG - Static variable in class com.datasalt.pangool.solr.HeartBeater
 

M

map(ITuple, NullWritable, TupleMapper<ITuple, NullWritable>.TupleMRContext, TupleMapper.Collector) - Method in class com.datasalt.pangool.tuplemr.IdentityTupleMapper
 
map(I1, I2, Mapper<I1, I2, O1, O2>.Context, MultipleOutputsCollector) - Method in class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
 
map(I1, I2, Mapper<I1, I2, O1, O2>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
 
map(INPUT_KEY, INPUT_VALUE, TupleMapper<INPUT_KEY, INPUT_VALUE>.TupleMRContext, TupleMapper.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleMapper
Called once per each input pair of key/values.
map(INPUT_KEY, INPUT_VALUE, Mapper<INPUT_KEY, INPUT_VALUE, DatumWrapper<ITuple>, NullWritable>.Context) - Method in class com.datasalt.pangool.tuplemr.TupleMapper
Do not override! Override TupleMapper.map(Object, Object, TupleMRContext, Collector) instead.
MapOnlyJobBuilder - Class in com.datasalt.pangool.tuplemr
The MapOnlyJobBuilder is a simple Pangool primitive that executes map-only Jobs.
MapOnlyJobBuilder(Configuration) - Constructor for class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
MapOnlyJobBuilder(Configuration, String) - Constructor for class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
MapOnlyMapper<I1,I2,O1,O2> - Class in com.datasalt.pangool.tuplemr.mapred
Mapper to be implemented by Map-only jobs.
MapOnlyMapper() - Constructor for class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
 
METADATA_BYTES_AS_OBJECT - Static variable in class com.datasalt.pangool.io.Schema.Field
 
METADATA_DEFAULT_VALUE - Static variable in class com.datasalt.pangool.io.Schema.Field
 
METADATA_OBJECT_CLASS - Static variable in class com.datasalt.pangool.io.Schema.Field
 
METADATA_OBJECT_SERIALIZATION - Static variable in class com.datasalt.pangool.io.Schema.Field
 
minusFields(Schema, String...) - Static method in class com.datasalt.pangool.io.Mutator
Creates a new schema which has exactly the same fields as the input Schema minus the field names that are specified as "minusFields".
minusFields(String, Schema, String...) - Static method in class com.datasalt.pangool.io.Mutator
Creates a new schema which has exactly the same fields as the input Schema minus the field names that are specified as "minusFields".
MultipleInputsInterface - Class in com.datasalt.pangool.tuplemr
This class encapsulates the functionality of a builder such as TupleMRBuilder that provides Multiple Inputs.
MultipleInputsInterface(Configuration) - Constructor for class com.datasalt.pangool.tuplemr.MultipleInputsInterface
 
MultipleInputsInterface.Input - Class in com.datasalt.pangool.tuplemr
 
multipleOutputs - Variable in class com.datasalt.pangool.tuplemr.MultipleOutputsCollector
 
MultipleOutputsCollector - Class in com.datasalt.pangool.tuplemr
 
MultipleOutputsCollector(MapContext) - Constructor for class com.datasalt.pangool.tuplemr.MultipleOutputsCollector
 
MultipleOutputsCollector(ReduceContext) - Constructor for class com.datasalt.pangool.tuplemr.MultipleOutputsCollector
 
Mutator - Class in com.datasalt.pangool.io
Miscellaneous utilities for mutating Pangool schemas.
Mutator() - Constructor for class com.datasalt.pangool.io.Mutator
 

N

NamedOutputsInterface - Class in com.datasalt.pangool.tuplemr
This class encapsulates the functionality of a builder such as TupleMRBuilder that provides Named Outputs.
NamedOutputsInterface(Configuration) - Constructor for class com.datasalt.pangool.tuplemr.NamedOutputsInterface
 
NamedOutputsInterface.Output - Class in com.datasalt.pangool.tuplemr
 
needHeartBeat() - Method in class com.datasalt.pangool.solr.HeartBeater
inform the background thread that heartbeats are to be issued.
needsTaskCommit(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
next(int) - Method in interface com.datasalt.pangool.io.NextByte
 
next(ITuple) - Method in class com.datasalt.pangool.io.TupleFile.Reader
Read the next ITuple in the file into tuple.
next() - Method in class com.datasalt.pangool.tuplemr.mapred.TupleIterator
 
NextByte - Interface in com.datasalt.pangool.io
Copyright [2012] [Datasalt Systems S.L.] Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License.
nextKeyValue() - Method in class com.datasalt.pangool.tuplemr.avro.AvroRecordReader
 
nextKeyValue() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingRecordReader
 
nextKeyValue() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
nextKeyValue() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
nextKeyValue() - Method in class org.apache.avro.mapreduce.lib.input.AvroRecordReader
 
NO_ESCAPE_CHARACTER - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
NO_ESCAPE_CHARACTER - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat
 
NO_NULL_STRING - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
NO_QUOTE_CHARACTER - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
NO_QUOTE_CHARACTER - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat
 
NO_SEPARATOR_CHARACTER - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
 
NONE - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.FieldSelector
 
NULLABLE_CHAR - Static variable in class com.datasalt.pangool.io.Fields
 
NullableCSVTokenizer - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
A custom CSVTokenizer policy that detects and provides null values if needed.
NullableCSVTokenizer(char, boolean, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.NullableCSVTokenizer
Escaping wasn't handled by the provided JSCV tokenizer so we add it here.
nullCompare(Object, Object, Criteria.SortElement) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
nulls - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 

O

offsets - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
onCloseGroup(int, String, ITuple, TupleReducer.TupleMRContext, TupleReducer<OUTPUT_KEY, OUTPUT_VALUE>.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleRollupReducer
This is the method called after every sub-group is being closed when rollup is used.
onOpenGroup(int, String, ITuple, TupleReducer.TupleMRContext, TupleReducer<OUTPUT_KEY, OUTPUT_VALUE>.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleRollupReducer
This is the method called any time that a sub-group is opened when rollup is used.
onTuple(ITuple) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary.PrintVisitor
 
onTuple(ITuple) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary.TupleVisitor
 
open(InputStream) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffDeserializer
 
open(OutputStream) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffSerializer
 
open(InputStream) - Method in class com.datasalt.pangool.serialization.ThriftSerialization.TDeserializerAdapter
 
open(OutputStream) - Method in class com.datasalt.pangool.serialization.ThriftSerialization.TSerializerAdapter
 
open(InputStream) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldDeserializer
 
open(OutputStream) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldSerializer
 
open(InputStream) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
open(OutputStream) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
open(InputStream) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleDeserializer
 
open(OutputStream) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerializer
 
OrderBy - Class in com.datasalt.pangool.tuplemr
OrderBy is a convenience builder used by TupleMRConfig , similar to Criteria.
OrderBy(List<Criteria.SortElement>) - Constructor for class com.datasalt.pangool.tuplemr.OrderBy
 
OrderBy() - Constructor for class com.datasalt.pangool.tuplemr.OrderBy
 
org.apache.avro.mapreduce.lib.input - package org.apache.avro.mapreduce.lib.input
 
org.apache.avro.mapreduce.lib.output - package org.apache.avro.mapreduce.lib.output
 
originalDir - Variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
outputFormat - Variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
outputs - Variable in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 

P

Pair<T1,T2> - Class in com.datasalt.pangool.utils
 
Pair(T1, T2) - Constructor for class com.datasalt.pangool.utils.Pair
 
PANGOOL_INPUT_DIR_FORMATS_PREFIX_CONF - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.PangoolMultipleInputs
 
PANGOOL_INPUT_DIR_MAPPERS_PREFIX_CONF - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.input.PangoolMultipleInputs
 
PangoolDriver - Class in com.datasalt.pangool
Copied from Hadoop's ProgramDriver to avoid clashing conflicts between 2.0 and 1.0.
PangoolDriver() - Constructor for class com.datasalt.pangool.PangoolDriver
 
PangoolMultipleInputs - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
This class supports MapReduce jobs that have multiple input paths with a different InputFormat and Mapper for each path.
PangoolMultipleInputs() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.PangoolMultipleInputs
 
PangoolMultipleOutputs<KEYOUT,VALUEOUT> - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
This class is inspired by the MultipleOutputs class of Hadoop.
PangoolMultipleOutputs(TaskInputOutputContext<?, ?, KEYOUT, VALUEOUT>) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Creates and initializes multiple outputs support, it should be instantiated in the Mapper/Reducer setup method.
PangoolMultipleOutputs.InvalidNamedOutputException - Exception in com.datasalt.pangool.tuplemr.mapred.lib.output
Exception that is thrown when someone tries to access an invalid named output.
PangoolMultipleOutputs.InvalidNamedOutputException(String) - Constructor for exception com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs.InvalidNamedOutputException
 
PangoolRuntimeException - Exception in com.datasalt.pangool
 
PangoolRuntimeException(Throwable) - Constructor for exception com.datasalt.pangool.PangoolRuntimeException
 
PangoolRuntimeException(String) - Constructor for exception com.datasalt.pangool.PangoolRuntimeException
 
PangoolRuntimeException(String, Throwable) - Constructor for exception com.datasalt.pangool.PangoolRuntimeException
 
PangoolStoreFunc - Class in com.datasalt.pangool.pig
A Pig's StoreFunc that can be used to save the result of a Pig flow into a Pangool-friendly format.
PangoolStoreFunc(String...) - Constructor for class com.datasalt.pangool.pig.PangoolStoreFunc
First argument is schema name, the others are the column names.
parse(String) - Static method in class com.datasalt.pangool.io.Fields
Parses schemas represented comma separated lists of: (field_name):(field_type)
Available types are: int long boolean float double string Class name (Any class that already supports Hadoop serialization)
Example of schema: name:string,age:int,weight:float
Additionally, fields can handle null values.
parse(String) - Static method in class com.datasalt.pangool.io.Schema
 
parse(JsonNode) - Static method in class com.datasalt.pangool.io.Schema
 
parse(JsonNode) - Static method in class com.datasalt.pangool.tuplemr.Aliases
 
parse(JsonNode) - Static method in class com.datasalt.pangool.tuplemr.Criteria
 
parse(String) - Static method in class com.datasalt.pangool.tuplemr.OrderBy
Parse in the form "field1:asc, field2:desc,...,fieldn:asc|null_smallest"
Examples:
"f1,f2,f3"
"f1:asc,f2:desc,f3"
"f1,f2:asc|null_smallest,f3:desc|null_biggest"
parse(String) - Static method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Parse a schema from the provided string.
partialHashCode(ITuple, int[]) - Method in class com.datasalt.pangool.tuplemr.mapred.TupleHashPartitioner
Calculates a combinated hashCode using the specified number of fields.
prepare() - Method in class com.datasalt.pangool.utils.test.AbstractBaseTest
 
prepareToWrite(RecordWriter) - Method in class com.datasalt.pangool.pig.PangoolStoreFunc
 
previousDatum() - Method in class com.datasalt.pangool.io.DatumWrapper
 
process(Logger, String) - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
 
process(Logger, String) - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
 
processLine(String) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.CustomCSVWriter
 
ProtoStuffSerialization - Class in com.datasalt.pangool.serialization
 
ProtoStuffSerialization() - Constructor for class com.datasalt.pangool.serialization.ProtoStuffSerialization
 
ProtoStuffSerialization.ProtoStuffDeserializer<T extends com.dyuproject.protostuff.Schema> - Class in com.datasalt.pangool.serialization
 
ProtoStuffSerialization.ProtoStuffDeserializer(Class<T>) - Constructor for class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffDeserializer
 
ProtoStuffSerialization.ProtoStuffSerializer<T extends com.dyuproject.protostuff.Schema> - Class in com.datasalt.pangool.serialization
 
ProtoStuffSerialization.ProtoStuffSerializer() - Constructor for class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffSerializer
 
PROXIED_OUTPUT_FORMAT_CONF - Static variable in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
ProxyOutputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
This special implementation of FileOutputFormat is used as a proxy for being able to support any type of OutputFormat at the same time that we support Multiple Output Formats (also with any type of OutputFormat).
ProxyOutputFormat() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
ProxyOutputFormat.ProxyOutputCommitter - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
 
ProxyOutputFormat.ProxyOutputCommitter(Path, TaskAttemptContext, OutputCommitter) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
ProxyOutputFormat.ProxyOutputCommitter(Path, TaskAttemptContext) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
putNext(Tuple) - Method in class com.datasalt.pangool.pig.PangoolStoreFunc
 

Q

queueBatch(Collection<SolrInputDocument>) - Method in class com.datasalt.pangool.solr.BatchWriter
 

R

randomChar() - Static method in class com.datasalt.pangool.utils.TestUtils
 
randomString(int) - Static method in class com.datasalt.pangool.utils.TestUtils
 
readBytes(DataInputStream, ITuple, int) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
readCustomObject(DataInputStream, ITuple, Class<?>, int, Deserializer) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
readEnum(DataInputStream, ITuple, Class<?>, int) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
readFields(DataInput) - Method in class com.datasalt.pangool.io.BytesWritable
 
readFields(DataInput) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
 
readFields(DataInput) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
readFields(ITuple, Deserializer[]) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
Read fields using the specified "readSchema" in the constructor.
readFields(DataInput) - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.CustomInputSplit
 
readIntDoubleMap(Path, FileSystem) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Reads maps of integer -> double
readIntDoubleMapFromGlob(Path, FileSystem) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Reads maps of integer -> double from glob paths like "folder/part-r*"
readIntIntMap(Path, FileSystem) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Reads maps of integer -> integer
readIntIntMapFromGlob(Path, FileSystem) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Reads maps of integer -> integer from glob paths like "folder/part-r*"
readTuples(Path, Configuration, AbstractHadoopTestLibrary.TupleVisitor) - Static method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
readUtf8(DataInputStream, ITuple, int) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
 
reduce(ITuple, Iterable<ITuple>, TupleReducer.TupleMRContext, TupleReducer<ITuple, NullWritable>.Collector) - Method in class com.datasalt.pangool.tuplemr.IdentityTupleReducer
 
reduce(DatumWrapper<ITuple>, Iterable<NullWritable>, Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.RollupReducer
 
reduce(DatumWrapper<ITuple>, Iterable<NullWritable>, Reducer<DatumWrapper<ITuple>, NullWritable, DatumWrapper<ITuple>, NullWritable>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.SimpleCombiner
 
reduce(DatumWrapper<ITuple>, Iterable<NullWritable>, Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.SimpleReducer
 
reduce(ITuple, Iterable<ITuple>, TupleReducer.TupleMRContext, TupleReducer<OUTPUT_KEY, OUTPUT_VALUE>.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleReducer
This method is called with an iterable that contains all the tuples that have been grouped by the fields defined in TupleMRConfigBuilder.setGroupByFields(String...)
remove() - Method in class com.datasalt.pangool.tuplemr.mapred.TupleIterator
 
removeFromCache(Configuration, String) - Static method in class com.datasalt.pangool.utils.InstancesDistributor
Delete a file that has been distributed using InstancesDistributor.distribute(Object, String, Configuration).
replace(byte[], int) - Method in class com.datasalt.pangool.io.BytesWritable
Replaces the internal byte array by the given one.
requiredConfigDirectories - Static variable in class com.datasalt.pangool.solr.SolrRecordWriter
 
RESERVED_KEYWORDS - Static variable in class com.datasalt.pangool.io.Schema.Field
 
RollupReducer<OUTPUT_KEY,OUTPUT_VALUE> - Class in com.datasalt.pangool.tuplemr.mapred
This Reducer implements a similar functionality than SimpleReducer but adding a Rollup feature.
RollupReducer() - Constructor for class com.datasalt.pangool.tuplemr.mapred.RollupReducer
 
run() - Method in class com.datasalt.pangool.solr.HeartBeater
 
run(Mapper.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.DelegatingMapper
 
run(Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.RollupReducer
 
runUpdate(List<SolrInputDocument>) - Method in class com.datasalt.pangool.solr.BatchWriter
 

S

Schema - Class in com.datasalt.pangool.io
A list of Schema.Field elements that a ITuple instance contains.
Schema(String, List<Schema.Field>) - Constructor for class com.datasalt.pangool.io.Schema
 
Schema.Field - Class in com.datasalt.pangool.io
A field is an abstract data type that can be one of this: A 32-bit signed int; A 64-bit signed long; A 32-bit IEEE single-float; or A 64-bit IEEE double-float; or A unicode string; A boolean; or An enum, containing one of a small set of symbols; A byte buffer An arbitrary object, serializable by Hadoop's serialization

A field can be constructed using one of its static createXXX methods.

Schema.Field.FieldConfigurable - Interface in com.datasalt.pangool.io
Interface that allows to receive ITuple field's metadata.
Schema.Field.Type - Enum in com.datasalt.pangool.io
 
Schema.SchemaParseException - Exception in com.datasalt.pangool.io
 
Schema.SchemaParseException(Throwable) - Constructor for exception com.datasalt.pangool.io.Schema.SchemaParseException
 
Schema.SchemaParseException(String) - Constructor for exception com.datasalt.pangool.io.Schema.SchemaParseException
 
seek(long) - Method in class com.datasalt.pangool.io.TupleFile.Reader
Set the current byte position in the input file.
select(int) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.FieldSelector
 
ser(DataOutput) - Method in class com.datasalt.pangool.io.BitField
Serializes the bit field to the data output.
ser(Object, OutputStream) - Method in class com.datasalt.pangool.serialization.HadoopSerialization
Serializes the given object using the Hadoop serialization system.
SerializationInfo - Class in com.datasalt.pangool.tuplemr
Contains information about how to perform binary internal serialization and comparison.
SerializationInfo(TupleMRConfig) - Constructor for class com.datasalt.pangool.tuplemr.SerializationInfo
 
serialize(T) - Method in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffSerializer
 
serialize(TBase) - Method in class com.datasalt.pangool.serialization.ThriftSerialization.TSerializerAdapter
 
serialize(T) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization.AvroFieldSerializer
 
serialize(ITuple) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
serialize(DatumWrapper<ITuple>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerializer
 
SerializerComparator - Class in com.datasalt.pangool.tuplemr.mapred
A simple RawComparator and Serializable that compares in binary.
SerializerComparator() - Constructor for class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
serializerComparator - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
serInfo - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
set(int, boolean) - Method in class com.datasalt.pangool.io.BitField
Sets or unsets a bit.
set(int) - Method in class com.datasalt.pangool.io.BitField
Sets a bit.
set(BytesWritable) - Method in class com.datasalt.pangool.io.BytesWritable
Set the BytesWritable to the contents of the given newData.
set(byte[], int, int) - Method in class com.datasalt.pangool.io.BytesWritable
Set the value to a copy of the given byte range
set(int, Object) - Method in interface com.datasalt.pangool.io.ITuple
 
set(String, Object) - Method in interface com.datasalt.pangool.io.ITuple
 
set(int, Object) - Method in class com.datasalt.pangool.io.Tuple
 
set(String, Object) - Method in class com.datasalt.pangool.io.Tuple
 
set(int, Object) - Method in class com.datasalt.pangool.io.ViewTuple
 
set(String, Object) - Method in class com.datasalt.pangool.io.ViewTuple
 
set(TupleMRConfig, Configuration) - Static method in class com.datasalt.pangool.tuplemr.TupleMRConfig
Returns the instance files generated.
setBatchWriteException(Exception) - Method in class com.datasalt.pangool.solr.BatchWriter
 
setCapacity(int) - Method in class com.datasalt.pangool.io.BytesWritable
Change the capacity of the backing storage.
setCapacity(int) - Method in class com.datasalt.pangool.utils.Buffer
Change the capacity of the backing storage.
setClosing(boolean) - Method in class com.datasalt.pangool.solr.SolrRecordWriter
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.GroupComparator
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.HadoopInputFormat
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.HadoopOutputFormat
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.SerializerComparator
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.mapred.TupleHashPartitioner
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
setConf(Configuration) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
setConf(Configuration) - Method in class com.datasalt.pangool.utils.AvroBinaryComparator
 
setContained(ITuple, int[]) - Method in class com.datasalt.pangool.io.ViewTuple
 
setContained(ITuple) - Method in class com.datasalt.pangool.io.ViewTuple
 
setCountersEnabled(Job, boolean) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Enables or disables counters for the named outputs.
setCustomComparator(RawComparator<?>) - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
setCustomPartitionFields(String...) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Sets the fields used to partition the tuples emmited by TupleMapper .
setDefaultNamedOutput(OutputFormat, Class, Class) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Sets the default named output specs.
setDefaultNamedOutput(OutputFormat, Class, Class, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Sets the default named output specs.
setDefaultNamedOutput(Schema) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Sets the default named output (Tuple format) specs.
setDefaultNamedOutput(Job, OutputFormat, Class<?>, Class<?>) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Adds a the specs of the default named output for the job (any named output which is not explicitly defined).
setDefaultNamedOutput(OutputFormat, Class, Class) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Sets the default named output specs.
setDefaultNamedOutput(OutputFormat, Class, Class, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Sets the default named output specs.
setDefaultNamedOutput(Schema) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Sets the default named output (Tuple format) specs.
setDeflateLevel(Job, int) - Static method in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
Enable output compression using the deflate codec and specify its level.
setDeflateLevel(Job, int) - Static method in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
Enable output compression using the deflate codec and specify its level.
setFieldAliases(String, Aliases) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Permits to set aliases, or alternate names,to fields that belong to intermediate schema's.
setFieldProperties(Map<String, String>, Map<String, String>) - Method in interface com.datasalt.pangool.io.Schema.Field.FieldConfigurable
Sets the properties for this field.
setFieldProperties(Map<String, String>, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.serialization.AvroFieldSerialization
 
setFieldProperties(Map<String, String>, Map<String, String>) - Method in class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
setFirst(T1) - Method in class com.datasalt.pangool.utils.Pair
 
setGroupByFields(String...) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Defines the fields used to group tuples by.
setIterator(Iterator<NullWritable>) - Method in class com.datasalt.pangool.tuplemr.mapred.TupleIterator
 
setJarByClass(Class<?>) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
setJarByClass(Class<?>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
Sets the jar by class , as in Job.setJarByClass(Class)
setMapper(MapOnlyMapper) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
Deprecated. 
setMaxFieldSize(int) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.NullableCSVTokenizer
 
setObjectSerialization(Class<? extends Serialization>) - Method in class com.datasalt.pangool.io.Schema.Field
Sets custom serialization for fields with type OBJECT.
setOrderBy(OrderBy) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Sets the criteria to sort the tuples by.
setOutput(Path, OutputFormat, Class<?>, Class<?>) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
setOutput(Path, OutputFormat, Class<?>, Class<?>) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
setProgress(Progressable) - Method in class com.datasalt.pangool.solr.HeartBeater
 
setRollupFrom(String) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
 
setSecond(T2) - Method in class com.datasalt.pangool.utils.Pair
 
setSerializations(Configuration) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.input.CascadingTupleInputFormat
Like in Cascading's TupleSerialization.setSerializations() but accepting a Hadoop's Configuration rather than JobConf.
setSize(int) - Method in class com.datasalt.pangool.io.BytesWritable
Change the size of the buffer.
setSize(int) - Method in class com.datasalt.pangool.utils.Buffer
Change the size of the buffer.
setSpecificInputContext(Configuration, String) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.input.PangoolMultipleInputs
Iterates over the Configuration and sets the specific context found for the input in the Job instance.
setSpecificNamedOutputContext(Configuration, Job, String) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Iterates over the Configuration and sets the specific context found for the namedOutput in the Job instance.
setSpecificOrderBy(String, OrderBy) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
Sets how tuples from the specific schemaName will be sorted after being sorted by commonOrderBy and schemaOrder
setStatus(String) - Method in class com.datasalt.pangool.solr.HeartBeater
 
setStoreLocation(String, Job) - Method in class com.datasalt.pangool.pig.PangoolStoreFunc
 
setSyncInterval(Job, int) - Static method in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
Set the sync interval to be used by the underlying DataFileWriter.
setSyncInterval(Job, int) - Static method in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
Set the sync interval to be used by the underlying DataFileWriter.
setTupleCombiner(TupleReducer) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
setTupleOutput(Path, Schema) - Method in class com.datasalt.pangool.tuplemr.MapOnlyJobBuilder
 
setTupleOutput(Path, Schema) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
setTupleReducer(TupleReducer) - Method in class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
setup(Mapper<I1, I2, O1, O2>.Context, MultipleOutputsCollector) - Method in class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
Override this method, not the other one.
setup(Mapper<I1, I2, O1, O2>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.MapOnlyMapper
Deprecated. This method shouldn't be implemented, otherwise MultipleOutputs may not work. When implemented, super.setup() should be called if MultipleOutputs are to be used. Therefore we have created another signature with the MultipleOutputsCollector in order to avoid this, which will be the official setup() method in the future (this one will be declared final in further versions).
setup(Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.RollupReducer
 
setup(Reducer<DatumWrapper<ITuple>, NullWritable, DatumWrapper<ITuple>, NullWritable>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.SimpleCombiner
 
setup(Reducer<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>.Context) - Method in class com.datasalt.pangool.tuplemr.mapred.SimpleReducer
 
setup(TupleMapper<INPUT_KEY, INPUT_VALUE>.TupleMRContext, TupleMapper.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleMapper
Called once at the start of the task.
setup(Mapper<INPUT_KEY, INPUT_VALUE, DatumWrapper<ITuple>, NullWritable>.Context) - Method in class com.datasalt.pangool.tuplemr.TupleMapper
Do not override.
setup(TupleReducer.TupleMRContext, TupleReducer<OUTPUT_KEY, OUTPUT_VALUE>.Collector) - Method in class com.datasalt.pangool.tuplemr.TupleReducer
 
setupJob(JobContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
setupTask(TaskAttemptContext) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.ProxyOutputFormat.ProxyOutputCommitter
 
setValue(V) - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.Duple
 
shallowCopy(ITuple) - Method in class com.datasalt.pangool.io.Tuple
Simple shallow copy of this Tuple to another Tuple.
SimpleCombiner - Class in com.datasalt.pangool.tuplemr.mapred
 
SimpleCombiner() - Constructor for class com.datasalt.pangool.tuplemr.mapred.SimpleCombiner
 
SimpleReducer<OUTPUT_KEY,OUTPUT_VALUE> - Class in com.datasalt.pangool.tuplemr.mapred
This is a proxy Reducer implementation that delegates its funcionality to a TupleReducer instance configured by TupleMRBuilder.setTupleReducer(TupleReducer)
SimpleReducer() - Constructor for class com.datasalt.pangool.tuplemr.mapred.SimpleReducer
 
SimpleTupleDeserializer - Class in com.datasalt.pangool.tuplemr.serialization
This Deserializer holds all the baseline code for deserializing Tuples.
SimpleTupleDeserializer(Schema, HadoopSerialization, Configuration) - Constructor for class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
Constructor with one Schema.
SimpleTupleDeserializer(Schema, Schema, HadoopSerialization, Configuration) - Constructor for class com.datasalt.pangool.tuplemr.serialization.SimpleTupleDeserializer
Constructor with two schemas.
SimpleTupleSerializer - Class in com.datasalt.pangool.tuplemr.serialization
This Serializer holds all the baseline code for serializing Tuples.
SimpleTupleSerializer(HadoopSerialization) - Constructor for class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
SimpleTupleSerializer(Schema, HadoopSerialization, Configuration) - Constructor for class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
SimpleTupleSerializer.CustomObjectSerializationException - Exception in com.datasalt.pangool.tuplemr.serialization
Thrown when an unexpected exception happens when serializing a custom object.
SimpleTupleSerializer.CustomObjectSerializationException() - Constructor for exception com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer.CustomObjectSerializationException
 
SimpleTupleSerializer.CustomObjectSerializationException(String) - Constructor for exception com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer.CustomObjectSerializationException
 
SimpleTupleSerializer.CustomObjectSerializationException(String, Throwable) - Constructor for exception com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer.CustomObjectSerializationException
 
SimpleTupleSerializer.CustomObjectSerializationException(Throwable) - Constructor for exception com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer.CustomObjectSerializationException
 
SolrRecordWriter - Class in com.datasalt.pangool.solr
Instantiate a record writer that will build a Solr index.
SolrRecordWriter(int, boolean, int, int, String, String, TupleDocumentConverter, TaskAttemptContext) - Constructor for class com.datasalt.pangool.solr.SolrRecordWriter
 
SortComparator - Class in com.datasalt.pangool.tuplemr.mapred
Tuple-based MapRed job binary comparator.
SortComparator() - Constructor for class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
Strings - Class in com.datasalt.pangool.utils
 
Strings() - Constructor for class com.datasalt.pangool.utils.Strings
 
stringToFile(FileSystem, Path, String) - Static method in class com.datasalt.pangool.utils.HadoopUtils
Creates a file with the given string, overwritting if needed.
subSetOf(Schema, String...) - Static method in class com.datasalt.pangool.io.Mutator
Creates a subset of the input Schema exactly with the fields whose names are specified.
subSetOf(String, Schema, String...) - Static method in class com.datasalt.pangool.io.Mutator
Creates a subset of the input Schema exactly with the fields whose names are specified.
superSetOf(Schema, Schema.Field...) - Static method in class com.datasalt.pangool.io.Mutator
Creates a superset of the input Schema, taking all the Fields in the input schema and adding some new ones.
superSetOf(String, Schema, Schema.Field...) - Static method in class com.datasalt.pangool.io.Mutator
Creates a superset of the input Schema, taking all the Fields in the input schema and adding some new ones.
swapInstances() - Method in class com.datasalt.pangool.io.DatumWrapper
 
sync(long) - Method in class com.datasalt.pangool.io.TupleFile.Reader
Seek to the next sync mark past a given position.
sync() - Method in class com.datasalt.pangool.io.TupleFile.Writer
create a sync point
SYNC_INTERVAL_KEY - Static variable in class com.datasalt.pangool.tuplemr.avro.AvroOutputFormat
The configuration key for Avro sync interval.
SYNC_INTERVAL_KEY - Static variable in class org.apache.avro.mapreduce.lib.output.AvroOutputFormat
The configuration key for Avro sync interval.
synchronize(FileSystem, Path, FileSystem, Path) - Static method in class com.datasalt.pangool.utils.HadoopUtils
 
syncSeen() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Returns true iff the previous call to next passed a sync mark.

T

TaggedInputSplit - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
An InputSplit that tags another InputSplit with extra data for use by DelegatingInputFormats and DelegatingMappers.
TaggedInputSplit() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
TaggedInputSplit(InputSplit, Configuration, String, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
Creates a new TaggedInputSplit.
TaskAttemptContextFactory - Class in com.datasalt.pangool.utils
A trick that uses reflection to be compatible with both Hadoop 1.0 and Hadoop 2.0 In Hadoop 2.0 TaskAttemptContext is an interface and cannot be instantiated.
TaskAttemptContextFactory() - Constructor for class com.datasalt.pangool.utils.TaskAttemptContextFactory
 
TestUtils - Class in com.datasalt.pangool.utils
 
TestUtils() - Constructor for class com.datasalt.pangool.utils.TestUtils
 
threadLocalBuffer - Static variable in class com.datasalt.pangool.serialization.ProtoStuffSerialization.ProtoStuffSerializer
 
ThriftSerialization - Class in com.datasalt.pangool.serialization
A Serialization for types generated by Apache Thrift.
ThriftSerialization() - Constructor for class com.datasalt.pangool.serialization.ThriftSerialization
 
ThriftSerialization.TDeserializerAdapter - Class in com.datasalt.pangool.serialization
 
ThriftSerialization.TDeserializerAdapter(Class<? extends TBase>) - Constructor for class com.datasalt.pangool.serialization.ThriftSerialization.TDeserializerAdapter
 
ThriftSerialization.TSerializerAdapter - Class in com.datasalt.pangool.serialization
 
ThriftSerialization.TSerializerAdapter() - Constructor for class com.datasalt.pangool.serialization.ThriftSerialization.TSerializerAdapter
 
ThriftUtils - Class in com.datasalt.pangool.utils
 
ThriftUtils() - Constructor for class com.datasalt.pangool.utils.ThriftUtils
 
toAvroSchema(Schema) - Static method in class com.datasalt.pangool.utils.AvroUtils
Converts from one Pangool schema to one Avro schema for serializing it
toJson(JsonGenerator) - Method in class com.datasalt.pangool.io.Schema
 
toJson(JsonGenerator) - Method in class com.datasalt.pangool.tuplemr.Criteria
 
toJson(boolean) - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
toJSON(TBase) - Static method in class com.datasalt.pangool.utils.ThriftUtils
Serializes a Thrift object with the protocol TSimpleJSONProtocol.
tokenizeLine(String, CSVStrategy, BufferedReader) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FixedWidthCSVTokenizer
 
tokenizeLine(String, CSVStrategy, BufferedReader) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.NullableCSVTokenizer
 
toPangoolSchema(Schema) - Static method in class com.datasalt.pangool.utils.AvroUtils
Converts from one Avro schema to one Pangool schema for de-serializing it
toRecord(ITuple, GenericData.Record) - Method in class com.datasalt.pangool.utils.TupleToAvroRecordConverter
Moves data between a Tuple and an Avro Record
toString() - Method in class com.datasalt.pangool.io.BitField
Returns the hexadecimal representation of the set backing array
toString() - Method in class com.datasalt.pangool.io.BytesWritable
Generate the stream of bytes as hex pairs separated by ' '.
toString() - Method in class com.datasalt.pangool.io.DatumWrapper
 
toString() - Method in class com.datasalt.pangool.io.Schema.Field
 
toString() - Method in class com.datasalt.pangool.io.Schema
 
toString(boolean) - Method in class com.datasalt.pangool.io.Schema
 
toString() - Method in class com.datasalt.pangool.io.Tuple
 
toString(ITuple) - Static method in class com.datasalt.pangool.io.Tuple
 
toString() - Method in class com.datasalt.pangool.io.TupleFile.Reader
Returns the name of the file.
toString() - Method in class com.datasalt.pangool.io.ViewTuple
 
toString() - Method in class com.datasalt.pangool.tuplemr.Criteria.SortElement
 
toString() - Method in class com.datasalt.pangool.tuplemr.Criteria
 
toString() - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
 
toString() - Method in class com.datasalt.pangool.tuplemr.OrderBy
 
toString() - Method in class com.datasalt.pangool.tuplemr.TupleMRConfig
 
trash(String...) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
Tuple - Class in com.datasalt.pangool.io
This is the basic implementation of ITuple.
Tuple(Schema) - Constructor for class com.datasalt.pangool.io.Tuple
 
Tuple.IDontKnowHowToCopyThisStuff - Exception in com.datasalt.pangool.io
Thrown by Tuple.deepCopy(ITuple, java.util.Map) in the case of field of a type that Pangool doesn't know how to copy it.
Tuple.IDontKnowHowToCopyThisStuff(String) - Constructor for exception com.datasalt.pangool.io.Tuple.IDontKnowHowToCopyThisStuff
 
TupleDeserializer - Class in com.datasalt.pangool.tuplemr.serialization
 
TupleDeserializer(HadoopSerialization, TupleMRConfig, Configuration) - Constructor for class com.datasalt.pangool.tuplemr.serialization.TupleDeserializer
 
TupleDocumentConverter - Interface in com.datasalt.pangool.solr
Implementations will map ITuple to SolrInputDocument.
TupleFieldSerialization - Class in com.datasalt.pangool.tuplemr.serialization
Serializes a Tuple withing a Pangool field.
TupleFieldSerialization() - Constructor for class com.datasalt.pangool.tuplemr.serialization.TupleFieldSerialization
 
TupleFile - Class in com.datasalt.pangool.io
Utilities class for reading and writing binary files with ITuple.
TupleFile() - Constructor for class com.datasalt.pangool.io.TupleFile
 
TupleFile.Reader - Class in com.datasalt.pangool.io
Reads SequenceFiles containing @{link ITuple} written using TupleFile.Writer.
TupleFile.Reader(FileSystem, Schema, Configuration, Path) - Constructor for class com.datasalt.pangool.io.TupleFile.Reader
Open the named file.
TupleFile.Reader(FileSystem, Configuration, Path) - Constructor for class com.datasalt.pangool.io.TupleFile.Reader
Open the named file.
TupleFile.Writer - Class in com.datasalt.pangool.io
Class for writing files containing ITuple.
TupleFile.Writer(FileSystem, Configuration, Path, Schema) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Create the named file for storing @{link ITuple}s with the given schema.
TupleFile.Writer(FileSystem, Configuration, Path, Schema, Progressable, SequenceFile.Metadata) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Create the named file with write-progress reporter for storing @{link ITuple}s with the given schema.
TupleFile.Writer(FileSystem, Configuration, Path, Schema, int, short, long, Progressable, SequenceFile.Metadata) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Create the named file with write-progress reporter for storing @{link ITuple}s with the given schema.
TupleFile.Writer(FileSystem, Configuration, Path, Schema, int, short, long, SequenceFile.CompressionType, CompressionCodec, Progressable, SequenceFile.Metadata) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Creates a TupleFile Writer.
TupleFile.Writer(Configuration, FSDataOutputStream, Schema, SequenceFile.CompressionType, CompressionCodec, SequenceFile.Metadata) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Creates a TupleFile Writer.
TupleFile.Writer(FileSystem, Configuration, Path, Schema, SequenceFile.CompressionType, CompressionCodec, Progressable) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Creates a TupleFile Writer.
TupleFile.Writer(FileSystem, Configuration, Path, Schema, SequenceFile.CompressionType, CompressionCodec, Progressable, SequenceFile.Metadata) - Constructor for class com.datasalt.pangool.io.TupleFile.Writer
Creates a TupleFile Writer.
TupleFileRecordReader - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
An RecordReader for TupleFiles.
TupleFileRecordReader() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
 
TupleFileRecordReader(Schema) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleFileRecordReader
If a schema is specified, it will be used as target schema, trying to preserve backwards compatibility always when possible.
TupleHashPartitioner - Class in com.datasalt.pangool.tuplemr.mapred
 
TupleHashPartitioner() - Constructor for class com.datasalt.pangool.tuplemr.mapred.TupleHashPartitioner
 
TupleInputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
An InputFormat for reading TupleFiles containing ITuple
TupleInputFormat(Schema) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleInputFormat
 
TupleInputFormat() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleInputFormat
 
TupleIterator<OUTPUT_KEY,OUTPUT_VALUE> - Class in com.datasalt.pangool.tuplemr.mapred
Iterator used in SimpleReducer and RollupReducer.
TupleIterator(ReduceContext<DatumWrapper<ITuple>, NullWritable, OUTPUT_KEY, OUTPUT_VALUE>) - Constructor for class com.datasalt.pangool.tuplemr.mapred.TupleIterator
 
TupleMapper<INPUT_KEY,INPUT_VALUE> - Class in com.datasalt.pangool.tuplemr
TupleMapper is the Tuple-based Hadoop's Mapper version.
TupleMapper() - Constructor for class com.datasalt.pangool.tuplemr.TupleMapper
 
TupleMapper.Collector - Class in com.datasalt.pangool.tuplemr
Class for collecting data inside a TupleMapper.
TupleMapper.StaticTupleMRContext<INPUT_KEY,INPUT_VALUE> - Class in com.datasalt.pangool.tuplemr
 
TupleMapper.TupleMRContext - Class in com.datasalt.pangool.tuplemr
 
TupleMRBuilder - Class in com.datasalt.pangool.tuplemr
TupleMRBuilder creates Tuple-based Map-Reduce jobs.
TupleMRBuilder(Configuration) - Constructor for class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
TupleMRBuilder(Configuration, String) - Constructor for class com.datasalt.pangool.tuplemr.TupleMRBuilder
 
tupleMRConf - Variable in class com.datasalt.pangool.tuplemr.mapred.SortComparator
 
TupleMRConfig - Class in com.datasalt.pangool.tuplemr
TupleMRConfig contains the entire configuration parameters from a Tuple-based job.
TupleMRConfig() - Constructor for class com.datasalt.pangool.tuplemr.TupleMRConfig
 
TupleMRConfigBuilder - Class in com.datasalt.pangool.tuplemr
ConfigBuilder creates TupleMRConfig immutable instances.
TupleMRConfigBuilder() - Constructor for class com.datasalt.pangool.tuplemr.TupleMRConfigBuilder
 
TupleMRException - Exception in com.datasalt.pangool.tuplemr
 
TupleMRException(Throwable) - Constructor for exception com.datasalt.pangool.tuplemr.TupleMRException
 
TupleMRException(String, Throwable) - Constructor for exception com.datasalt.pangool.tuplemr.TupleMRException
 
TupleMRException(String) - Constructor for exception com.datasalt.pangool.tuplemr.TupleMRException
 
TupleOutputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
An OutputFormat that writes ITuples.
TupleOutputFormat() - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleOutputFormat
Empty constructor means the output Schema will be picked from the first Tuple that is emitted.
TupleOutputFormat(Schema) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleOutputFormat
Providing output schema enables output validation.
TupleOutputFormat(String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleOutputFormat
Deprecated. 
TupleReducer<OUTPUT_KEY,OUTPUT_VALUE> - Class in com.datasalt.pangool.tuplemr
This is the common interface that any TupleMRBuilder job needs to implement.
TupleReducer() - Constructor for class com.datasalt.pangool.tuplemr.TupleReducer
 
TupleReducer.Collector - Class in com.datasalt.pangool.tuplemr
Base class for collecting data from a TupleReducer in the reduce phase.
TupleReducer.Collector(ReduceContext<DatumWrapper<ITuple>, NullWritable, Object, Object>) - Constructor for class com.datasalt.pangool.tuplemr.TupleReducer.Collector
 
TupleReducer.CombinerCollector - Class in com.datasalt.pangool.tuplemr
Class for collecting data from a TupleReducer in the combining phase
TupleReducer.CombinerCollector(ReduceContext<DatumWrapper<ITuple>, NullWritable, Object, Object>) - Constructor for class com.datasalt.pangool.tuplemr.TupleReducer.CombinerCollector
This non static inner class is created to eliminate the need in of the extended GroupHandler methods to specify the generic types for the Collector meanwhile keeping generics.
TupleReducer.StaticCollector<OUTPUT_KEY,OUTPUT_VALUE,CONTEXT_OUTPUT_KEY,CONTEXT_OUTPUT_VALUE> - Class in com.datasalt.pangool.tuplemr
A base class for the TupleReducer.Collector
TupleReducer.StaticCollector(ReduceContext<DatumWrapper<ITuple>, NullWritable, CONTEXT_OUTPUT_KEY, CONTEXT_OUTPUT_VALUE>) - Constructor for class com.datasalt.pangool.tuplemr.TupleReducer.StaticCollector
 
TupleReducer.TupleMRContext - Class in com.datasalt.pangool.tuplemr
 
TupleReducer.TupleMRContext(ReduceContext<DatumWrapper<ITuple>, NullWritable, Object, Object>, TupleMRConfig) - Constructor for class com.datasalt.pangool.tuplemr.TupleReducer.TupleMRContext
 
TupleRollupReducer<OUTPUT_KEY,OUTPUT_VALUE> - Class in com.datasalt.pangool.tuplemr
Similar to TupleReducer but added functionality to be used with rollup
TupleRollupReducer() - Constructor for class com.datasalt.pangool.tuplemr.TupleRollupReducer
 
TupleSerialization - Class in com.datasalt.pangool.tuplemr.serialization
A Serialization for DatumWrapper
TupleSerialization() - Constructor for class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
TupleSerialization(HadoopSerialization, TupleMRConfig) - Constructor for class com.datasalt.pangool.tuplemr.serialization.TupleSerialization
 
TupleSerializer - Class in com.datasalt.pangool.tuplemr.serialization
 
TupleSerializer(HadoopSerialization, TupleMRConfig, boolean) - Constructor for class com.datasalt.pangool.tuplemr.serialization.TupleSerializer
 
TupleSolrOutputFormat - Class in com.datasalt.pangool.solr
Instantiable OutputFormat that can be used in Pangool for indexing ITuple in SOLR.
TupleSolrOutputFormat(File, Configuration) - Constructor for class com.datasalt.pangool.solr.TupleSolrOutputFormat
 
TupleSolrOutputFormat(File, Configuration, TupleDocumentConverter) - Constructor for class com.datasalt.pangool.solr.TupleSolrOutputFormat
 
TupleSolrOutputFormat(File, Configuration, TupleDocumentConverter, boolean, int, int, int) - Constructor for class com.datasalt.pangool.solr.TupleSolrOutputFormat
 
TupleTextInputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
A special input format that supports reading text lines into ITuple.
TupleTextInputFormat(Schema, boolean, boolean, Character, Character, Character, TupleTextInputFormat.FieldSelector, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
Character separated files reader.
TupleTextInputFormat(Schema, int[], boolean, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat
Fixed width fields file reader.
TupleTextInputFormat.FieldSelector - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
When provided, will use it as a mapping between the text file columns and the provided Schema.
TupleTextInputFormat.FieldSelector(Integer...) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.FieldSelector
 
TupleTextInputFormat.TupleTextInputReader - Class in com.datasalt.pangool.tuplemr.mapred.lib.input
 
TupleTextInputFormat.TupleTextInputReader(Schema, boolean, boolean, Character, Character, Character, TupleTextInputFormat.FieldSelector, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
TupleTextInputFormat.TupleTextInputReader(Schema, int[], boolean, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.input.TupleTextInputFormat.TupleTextInputReader
 
TupleTextOutputFormat - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
A special output format that supports converting a ITuple into text.
TupleTextOutputFormat(Schema, boolean, char, char, char) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat
 
TupleTextOutputFormat(Schema, boolean, char, char, char, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat
You must specify the Schema that will be used for Tuples being written and the CSV semantics (if any).
TupleTextOutputFormat.CustomCSVWriter - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
We had to almost re-implement CSVWriter for properly supporting null strings.
TupleTextOutputFormat.CustomCSVWriter(Writer, char, char, char, String) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.CustomCSVWriter
 
TupleTextOutputFormat.TupleTextRecordWriter - Class in com.datasalt.pangool.tuplemr.mapred.lib.output
 
TupleTextOutputFormat.TupleTextRecordWriter(Schema, TupleTextOutputFormat.CustomCSVWriter) - Constructor for class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.TupleTextRecordWriter
 
TupleToAvroRecordConverter - Class in com.datasalt.pangool.utils
 
TupleToAvroRecordConverter(Schema, Configuration) - Constructor for class com.datasalt.pangool.utils.TupleToAvroRecordConverter
 

U

unset(int) - Method in class com.datasalt.pangool.io.BitField
Unsets a bit.
Utf8 - Class in com.datasalt.pangool.io
Main String type for Pangool.
Utf8() - Constructor for class com.datasalt.pangool.io.Utf8
 
Utf8(byte[]) - Constructor for class com.datasalt.pangool.io.Utf8
 
Utf8(String) - Constructor for class com.datasalt.pangool.io.Utf8
 
Utf8(Text) - Constructor for class com.datasalt.pangool.io.Utf8
 

V

validateOutputName(String) - Static method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Convenience method for validating output names externally.Will throw InvalidArgumentException if parameter name is not a valid output name according to this implementation.
valueAt(int, ITuple, int[]) - Method in class com.datasalt.pangool.tuplemr.serialization.SimpleTupleSerializer
 
valueOf(String) - Static method in enum com.datasalt.pangool.io.Schema.Field.Type
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.datasalt.pangool.tuplemr.Criteria.NullOrder
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum com.datasalt.pangool.tuplemr.Criteria.Order
Returns the enum constant of this type with the specified name.
values() - Static method in enum com.datasalt.pangool.io.Schema.Field.Type
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.datasalt.pangool.tuplemr.Criteria.NullOrder
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum com.datasalt.pangool.tuplemr.Criteria.Order
Returns an array containing the constants of this enum type, in the order they are declared.
ViewTuple - Class in com.datasalt.pangool.io
A ITuple with a contained one, but that creates a read only view over it and filtering only for some fields.
ViewTuple(Schema) - Constructor for class com.datasalt.pangool.io.ViewTuple
 
ViewTuple(Schema, int[]) - Constructor for class com.datasalt.pangool.io.ViewTuple
 

W

withInput(String, Object, Object) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
withInput(String, Object) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
withOutput(String, Object) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
withOutput(String, Object, Object) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
withTupleInput(String, ITuple) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
withTupleOutput(String, ITuple) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
writable(Object) - Method in class com.datasalt.pangool.utils.test.AbstractHadoopTestLibrary
 
write(DataOutput) - Method in class com.datasalt.pangool.io.BytesWritable
 
write(ITuple, NullWritable) - Method in class com.datasalt.pangool.solr.SolrRecordWriter
Write a record.
write(DataOutput) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.FileSplit
 
write(DataOutput) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.input.TaggedInputSplit
 
write(String, K, V) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Write key and value to the namedOutput.
write(String, K, V, String) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.PangoolMultipleOutputs
Write key and value to baseOutputPath using the namedOutput.
write(ITuple, NullWritable) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.TupleTextRecordWriter
 
write(String, K, V) - Method in class com.datasalt.pangool.tuplemr.MultipleOutputsCollector
 
write(ITuple) - Method in class com.datasalt.pangool.tuplemr.TupleMapper.Collector
 
write(OUTPUT_KEY, OUTPUT_VALUE) - Method in class com.datasalt.pangool.tuplemr.TupleReducer.CombinerCollector
Overrided write for wrapping tuples into DatumWrappers.
write(OUTPUT_KEY, OUTPUT_VALUE) - Method in class com.datasalt.pangool.tuplemr.TupleReducer.StaticCollector
 
write(DataOutput) - Method in class com.datasalt.pangool.utils.test.CollectionInputFormat.CustomInputSplit
 
writeNext(String[]) - Method in class com.datasalt.pangool.tuplemr.mapred.lib.output.TupleTextOutputFormat.CustomCSVWriter
 
writeTXT(String, File) - Static method in class com.datasalt.pangool.utils.CommonUtils
Writes the string into the file.

Z

ZIP_FILE_BASE_NAME - Static variable in class com.datasalt.pangool.solr.TupleSolrOutputFormat
The base name of the zip file containing the configuration information.
zipDirectory(Configuration, ZipOutputStream, String, String, Path) - Static method in class com.datasalt.pangool.solr.SolrRecordWriter
Write a file to a zip output stream, removing leading path name components from the actual file name when creating the zip file entry.

A B C D E F G H I J L M N O P Q R S T U V W Z

Copyright © –2014 Datasalt Systems S.L.. All rights reserved.