代写DTS002TC Essential of Big Data Coursework 2代做留学生Python程序

DTS002TC Essential of Big Data

Coursework 2 (Individual Assessment)

Due: 5:00 pm China time (UTC+8 Beijing) on Sat. 24th. May. 2025

Weight: 50%

Maximum score: 100 marks (100 % individual marks)

Assessed learning outcomes:

E. Demonstrate the ability to write codes to obtain numerical solutions to mathematical problems.

F. Demonstrate the ability to display computational results in tabulated or graphical forms. Develop an understanding of the industrial and commercial applications of big data.

Late policy: 5%of the total marks available for the assessment shall be deducted from the assessment mark for each working day after the submission date, up to a maximum of five working days.

Risks:

Please read the coursework instructions and requirements carefully. Not following these instructions and requirements may result in loss of marks.

Plagiarism results in award of ZERO mark.

The formal procedure for submitting coursework at XJTLU is strictly followed. Submission link on Learning Mall will be provided in due course.The submission time stamp on Learning Mall will be used to check late submission.

Overview

This coursework aims to provide students with hands-on experience in analyzing and predicting global electricity generation data using Python. Students will be required to perform. data reading, preprocessing, prediction, visualization, and validation of their predictions against real-world data for selected countries. This exercise will help students understand the practical applications of big data analytics in the electricity sector and enhance their skills in data manipulation, visualization, and predictive modeling.

Task 1: Data Processing and Analysis (40 marks)

1.1 Data Reading and Preprocessing (15 marks)

Using Python, perform. the following tasks:

a. Import the necessary libraries (e.g., pandas, numpy).  (3 Marks)

b. Load the GlobalElectricityStatistics.csv dataset into a DataFrame. named electricity_data.  (3 Marks)

c. Display and check the first and last five rows of the DataFrame.  (3 Marks)

d. Show the basic information of the DataFrame, including dimensions, column details, data types, and memory usage.  (3 Marks)

e. Handle any missing values or inconsistencies in the data.  (3 Marks)

1.2 Data Visualization (10 marks)

Visualize the electricity generation trends for five selected countries (e.g., China, United States, Ireland, South Africa, India) from 1980 to 2021. Using Python, perform. the following tasks:

a. Plot line charts for each country showing the electricity net generation over the years. (5 Marks)

b. Use appropriate titles, labels, and legends to make the charts readable. (5 Marks)

1.3 Data Aggregation and Summary (15 marks)

Using Python, perform. the following tasks:

a. Calculate and display the average annual electricity generation for each country from 2000 to 2021. (5 marks)

b. Identify and display the country with the highest and lowest average electricity generation during this period. (5 marks)

c. Visualize the comparison of average electricity generation among the selected countries using a bar chart. (5 marks)

Task 2: Predictive Modeling and Discussion (60 marks)

"Net Consumption" refers to the total amount of electricity that is actually used by consumers within a specific area or country over a given period.

Net Consumption is calculated by considering the following components:

Net Generation: The total amount of electricity generated within the area, minus the electricity used by the power plants themselves (e.g., for plant operations).

Imports: The amount of electricity imported from other regions or countries.

Exports: The amount of electricity exported to other regions or countries.

Distribution Losses: The amount of electricity lost during transmission and distribution.

The formula for Net Consumption is:

Net Consumption=Net Generation Imports − Exports − Distribution Losses

2.1 Data Preparation for Prediction (15 marks)

Using Python, perform. the following tasks:

a. Calculate Net Consumption value for each country a country from 1980 to 2021. (5 marks)

b. Select the country with the highest average Net Consumption among all countries from 1980 to 2021. (5 marks)

c. Split the Net Consumption data of selected country into training and testing sets (e.g., 80% training, 20% testing). (5 marks)

2.2 Model Building and Prediction (15 marks)

Using Python, perform. the following tasks:

a. Initialize a suitable predictive model with possible parameters (e.g., linear regression, Naive Bayes). (5 marks)

b. Train the model using the Net Consumption training data of selected country(5 marks)

c. Predict the Net Consumption for the years 2022 to 2024 for the selected country. (5 marks)

2.3 Validation Against Real Data (15 marks)

a. Use internet resources to find the actual Net Consumption data for the selected country for the years 2022 to 2024 with python(5 marks)

b. Compare the predicted values with the actual values. Calculate the percentage error for each year with python(5 marks)

c. Discuss possible reasons for any discrepancies between the predicted and actual values within 200 words(5 marks)

2.4 Analysis and Conclusion (15 marks)

a. Summarize the findings from the predictive modeling and validation within 150 words(5 marks)

b. Provide insights on how big data analytics can be applied to improve electricity generation planning and management within 150 words(5 marks)

c. Provide insights on how big data analytics can be applied to improve other similar scenarios within 150 words. (5 marks)

Submission Format Instructions

The assignment must be typed, spell-checked, referenced, and submitted via Learning Mall Online to the correct dropbox.

Only electronic submissions are accepted - no hard copies:

l A Student_ID.pdf file contains a cover letter with your ID information, and all the task report content.

All students must download their file and check that it is viewable after submission. Document uploads may become corrupted during the uploading process (e.g., due to slow internet connections). Therefore, students themselves are responsible for submitting a functional and correct file that needs to be tested after submitting it.

Overall Marking Criteria

Code Quality and Implementation Results

Outstanding (100%): Code is exceptionally well-organized, readable, and well-commented. Implementation results are accurate and demonstrate a deep understanding of the concepts. All tasks are completed with high precision.

Appropriate (80%): Code is generally well-organized and readable. Implementation results are accurate and meet the requirements. Most tasks are completed effectively.

Needs Improvement (60%): Code is somewhat disorganized or poorly commented. Implementation results are mostly accurate but may have minor errors. Some tasks are incomplete or not fully addressed.

Hard to Understand (40%): Code is difficult to follow or lacks clarity. Implementation results are inaccurate or incomplete. Many tasks are not fully addressed or have significant errors.

No Submission or Missing Section (0%): No submission or critical sections of the assignment are missing.

Data Processing and Analysis (Task 1)

Outstanding (100%): Data reading, preprocessing, visualization, and aggregation are performed flawlessly. Results are presented clearly and accurately. All subtasks are completed with high precision.

Appropriate (80%): Data processing and analysis are generally well-executed. Results are mostly accurate and meet the requirements. Most subtasks are completed effectively.

Needs Improvement (60%): Data processing and analysis show some inaccuracies or inconsistencies. Results are partially accurate but may have minor errors. Some subtasks are incomplete.

Hard to Understand (40%): Data processing and analysis are poorly executed. Results are inaccurate or incomplete. Many subtasks are not fully addressed or have significant errors.

No Submission or Missing Section (0): No submission or critical sections of the assignment are missing.

Predictive Modeling and Discussion (Task 2)

Outstanding (100%): Predictive modeling and validation are performed with high precision. Model choice is well-justified, training is accurate, and predictions are reliable. All subtasks are completed with high precision.

Appropriate (80%): Predictive modeling and validation are generally well-executed. Model choice is justified, training is accurate, and predictions are mostly reliable. Most subtasks are completed effectively.

Needs Improvement (60%): Predictive modeling and validation show some inaccuracies or inconsistencies. Model choice may not be fully justified, training may have minor errors, and predictions may be less reliable.

Hard to Understand (40%): Predictive modeling and validation are poorly executed. Model choice is unclear, training is inaccurate, and predictions are unreliable.

No Submission or Missing Section (0%): No submission or critical sections of the assignment are missing.

Analysis and Conclusion

Outstanding (100%): Analysis is thorough and insightful. Conclusions are well-supported by the data and results. Insights are relevant and demonstrate a deep understanding of the topic. Summary and discussion are concise and clear.

Appropriate (80%): Analysis is generally thorough. Conclusions are supported by the data and results. Insights are relevant and demonstrate a good understanding of the topic. Summary and discussion are clear.

Needs Improvement (60%): Analysis is somewhat superficial. Conclusions may lack full support from the data. Insights are partially relevant. Summary and discussion may lack clarity.

Hard to Understand (40%): Analysis is incomplete or unclear. Conclusions are not well-supported by the data. Insights are irrelevant or unclear. Summary and discussion are difficult to understand.

No Submission or Missing Section (0%): No submission or critical sections of the assignment are missing.


热门主题

课程名

mktg2509 csci 2600 38170 lng302 csse3010 phas3226 77938 arch1162 engn4536/engn6536 acx5903 comp151101 phl245 cse12 comp9312 stat3016/6016 phas0038 comp2140 6qqmb312 xjco3011 rest0005 ematm0051 5qqmn219 lubs5062m eee8155 cege0100 eap033 artd1109 mat246 etc3430 ecmm462 mis102 inft6800 ddes9903 comp6521 comp9517 comp3331/9331 comp4337 comp6008 comp9414 bu.231.790.81 man00150m csb352h math1041 eengm4100 isys1002 08 6057cem mktg3504 mthm036 mtrx1701 mth3241 eeee3086 cmp-7038b cmp-7000a ints4010 econ2151 infs5710 fins5516 fin3309 fins5510 gsoe9340 math2007 math2036 soee5010 mark3088 infs3605 elec9714 comp2271 ma214 comp2211 infs3604 600426 sit254 acct3091 bbt405 msin0116 com107/com113 mark5826 sit120 comp9021 eco2101 eeen40700 cs253 ece3114 ecmm447 chns3000 math377 itd102 comp9444 comp(2041|9044) econ0060 econ7230 mgt001371 ecs-323 cs6250 mgdi60012 mdia2012 comm221001 comm5000 ma1008 engl642 econ241 com333 math367 mis201 nbs-7041x meek16104 econ2003 comm1190 mbas902 comp-1027 dpst1091 comp7315 eppd1033 m06 ee3025 msci231 bb113/bbs1063 fc709 comp3425 comp9417 econ42915 cb9101 math1102e chme0017 fc307 mkt60104 5522usst litr1-uc6201.200 ee1102 cosc2803 math39512 omp9727 int2067/int5051 bsb151 mgt253 fc021 babs2202 mis2002s phya21 18-213 cege0012 mdia1002 math38032 mech5125 07 cisc102 mgx3110 cs240 11175 fin3020s eco3420 ictten622 comp9727 cpt111 de114102d mgm320h5s bafi1019 math21112 efim20036 mn-3503 fins5568 110.807 bcpm000028 info6030 bma0092 bcpm0054 math20212 ce335 cs365 cenv6141 ftec5580 math2010 ec3450 comm1170 ecmt1010 csci-ua.0480-003 econ12-200 ib3960 ectb60h3f cs247—assignment tk3163 ics3u ib3j80 comp20008 comp9334 eppd1063 acct2343 cct109 isys1055/3412 math350-real math2014 eec180 stat141b econ2101 msinm014/msing014/msing014b fit2004 comp643 bu1002 cm2030
联系我们
EMail: 99515681@qq.com
QQ: 99515681
留学生作业帮-留学生的知心伴侣!
工作时间:08:00-21:00
python代写
微信客服:codinghelp
站长地图