Transform CSV Column Values into Single Row

Multi tool use

up vote
0
down vote

favorite

My data in CSV is like this(Expected Image):
Actual Data

And I want to convert this Data into:
Expected Data
(hivetablename.hivecolumnname = dbtablename.dbtablecolumn)

By joining the multiple Row values into a Single row value like above.

Please note that 'AND' is a Literal between the condition to be built, which would appear until the second last record.

Once the Last Record is reached, Only the condition would appear(xx=yy)

I wish the result to be in SCALA SPARK.

Many thanks in advance!

edited Nov 9 at 14:28

asked Nov 9 at 13:54

Farhan Soomro

New contributor

Could you please share what have you tried so far?
– ulubeyn
Nov 9 at 14:40

Many thanks for the response. Im a learner so couldnt go much further. Heres what Ive been able to achieve val dfconf= spark.read.option("header", "true").csv("file:///conf.csv").alias("conf"); dfconf.show(10, false) val newDF1 = dfconf.withColumn("Y",lit("1")) val newDF2 = newDF1.withColumn("Z",lit(",")) val newDF3 = newDF2.withColumn("comb",concat($"hivetablename",lit("."),$"hivetablecolumn",lit("="),$"dbtablename",lit("."),$"dbtablecolumn")) newDF3.show() val newDF4 = newDF3.groupBy("Y").agg(collect_set("z").as("combined"))
– Farhan Soomro
Nov 9 at 14:51

add a comment |

up vote
0
down vote

favorite

My data in CSV is like this(Expected Image):
Actual Data

And I want to convert this Data into:
Expected Data
(hivetablename.hivecolumnname = dbtablename.dbtablecolumn)

By joining the multiple Row values into a Single row value like above.

Please note that 'AND' is a Literal between the condition to be built, which would appear until the second last record.

Once the Last Record is reached, Only the condition would appear(xx=yy)

I wish the result to be in SCALA SPARK.

Many thanks in advance!

edited Nov 9 at 14:28

asked Nov 9 at 13:54

Farhan Soomro

New contributor

Could you please share what have you tried so far?
– ulubeyn
Nov 9 at 14:40

Many thanks for the response. Im a learner so couldnt go much further. Heres what Ive been able to achieve val dfconf= spark.read.option("header", "true").csv("file:///conf.csv").alias("conf"); dfconf.show(10, false) val newDF1 = dfconf.withColumn("Y",lit("1")) val newDF2 = newDF1.withColumn("Z",lit(",")) val newDF3 = newDF2.withColumn("comb",concat($"hivetablename",lit("."),$"hivetablecolumn",lit("="),$"dbtablename",lit("."),$"dbtablecolumn")) newDF3.show() val newDF4 = newDF3.groupBy("Y").agg(collect_set("z").as("combined"))
– Farhan Soomro
Nov 9 at 14:51

add a comment |

up vote
0
down vote

favorite

My data in CSV is like this(Expected Image):
Actual Data

And I want to convert this Data into:
Expected Data
(hivetablename.hivecolumnname = dbtablename.dbtablecolumn)

By joining the multiple Row values into a Single row value like above.

Please note that 'AND' is a Literal between the condition to be built, which would appear until the second last record.

Once the Last Record is reached, Only the condition would appear(xx=yy)

I wish the result to be in SCALA SPARK.

Many thanks in advance!

edited Nov 9 at 14:28

asked Nov 9 at 13:54

Farhan Soomro

New contributor

My data in CSV is like this(Expected Image):
Actual Data

And I want to convert this Data into:
Expected Data
(hivetablename.hivecolumnname = dbtablename.dbtablecolumn)

By joining the multiple Row values into a Single row value like above.

Please note that 'AND' is a Literal between the condition to be built, which would appear until the second last record.

Once the Last Record is reached, Only the condition would appear(xx=yy)

I wish the result to be in SCALA SPARK.

Many thanks in advance!

scala apache-spark-2.0

edited Nov 9 at 14:28

asked Nov 9 at 13:54

Farhan Soomro

New contributor

edited Nov 9 at 14:28

asked Nov 9 at 13:54

Farhan Soomro

New contributor

edited Nov 9 at 14:28

asked Nov 9 at 13:54

Farhan Soomro

New contributor

asked Nov 9 at 13:54

Farhan Soomro

asked Nov 9 at 13:54

Farhan Soomro

New contributor

Farhan Soomro is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

Could you please share what have you tried so far?
– ulubeyn
Nov 9 at 14:40

Many thanks for the response. Im a learner so couldnt go much further. Heres what Ive been able to achieve val dfconf= spark.read.option("header", "true").csv("file:///conf.csv").alias("conf"); dfconf.show(10, false) val newDF1 = dfconf.withColumn("Y",lit("1")) val newDF2 = newDF1.withColumn("Z",lit(",")) val newDF3 = newDF2.withColumn("comb",concat($"hivetablename",lit("."),$"hivetablecolumn",lit("="),$"dbtablename",lit("."),$"dbtablecolumn")) newDF3.show() val newDF4 = newDF3.groupBy("Y").agg(collect_set("z").as("combined"))
– Farhan Soomro
Nov 9 at 14:51

add a comment |

Could you please share what have you tried so far?
– ulubeyn
Nov 9 at 14:40

Many thanks for the response. Im a learner so couldnt go much further. Heres what Ive been able to achieve val dfconf= spark.read.option("header", "true").csv("file:///conf.csv").alias("conf"); dfconf.show(10, false) val newDF1 = dfconf.withColumn("Y",lit("1")) val newDF2 = newDF1.withColumn("Z",lit(",")) val newDF3 = newDF2.withColumn("comb",concat($"hivetablename",lit("."),$"hivetablecolumn",lit("="),$"dbtablename",lit("."),$"dbtablecolumn")) newDF3.show() val newDF4 = newDF3.groupBy("Y").agg(collect_set("z").as("combined"))
– Farhan Soomro
Nov 9 at 14:51

Could you please share what have you tried so far?
– ulubeyn
Nov 9 at 14:40

Many thanks for the response. Im a learner so couldnt go much further. Heres what Ive been able to achieve val dfconf= spark.read.option("header", "true").csv("file:///conf.csv").alias("conf"); dfconf.show(10, false) val newDF1 = dfconf.withColumn("Y",lit("1")) val newDF2 = newDF1.withColumn("Z",lit(",")) val newDF3 = newDF2.withColumn("comb",concat($"hivetablename",lit("."),$"hivetablecolumn",lit("="),$"dbtablename",lit("."),$"dbtablecolumn")) newDF3.show() val newDF4 = newDF3.groupBy("Y").agg(collect_set("z").as("combined"))
– Farhan Soomro
Nov 9 at 14:51

add a comment |

active

oldest

votes

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

Farhan Soomro is a new contributor. Be nice, and check out our Code of Conduct.

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53227060%2ftransform-csv-column-values-into-single-row%23new-answer', 'question_page');

);

Post as a guest

Name

active

oldest

votes

Farhan Soomro is a new contributor. Be nice, and check out our Code of Conduct.

draft saved

draft discarded

Farhan Soomro is a new contributor. Be nice, and check out our Code of Conduct.

draft saved

draft discarded

Post as a guest

Name

This page is only for reference, If you need detailed information, please check here

aet17mp9P0Tt0FC

搜尋此網誌

Pfthb