如何在Python中读取混合性质的字符串

huangapple go评论117阅读模式
英文:

How to read a string with mixed nature in python

问题

I have taken some data from an instrument. These data contain the temperature with time. A part of the data looks like,

  1. 00:00:08
  2. section A 76.21
  3. section B 76.54
  4. section C 76.66
  5. section D 77.87
  6. 00:15:30
  7. section A 76.23
  8. section B 76.89
  9. section C 77.98
  10. section D 78.60

and the file continues till the end.

00:00:08 means 00 hour, 00 minutes, and 08 seconds.

Now I need to read this data and make a plot of the temperature with time.

My code until now is as follows,

  1. import matplotlib.pyplot as plt
  2. import matplotlib.cm as cm
  3. from matplotlib.dates import DateFormatter
  4. from datetime import datetime
  5. import re
  6. def main():
  7. file = r"C:\Users\sarad\ss1.dat.log"
  8. index = []
  9. month = []
  10. day = []
  11. year = []
  12. hour = []
  13. minute = []
  14. second = []
  15. s1 = []
  16. s2 = []
  17. s3 = []
  18. s4 = []
  19. s5 = []
  20. s6 = []
  21. s7 = []
  22. s8 = []
  23. with open(file) as line:
  24. while True:
  25. result = re.findall(r"(\d+(?:.\d+){0,1})", line.read())
  26. if not result:
  27. break
  28. hour.append(int(result[0]))
  29. minute.append(int(result[1]))
  30. second.append(int(result[2]))
  31. s1.append(float(result[3]))
  32. s2.append(float(result[4]))
  33. s3.append(float(result[5]))
  34. s4.append(float(result[6]))
  35. s5.append(float(result[7]))
  36. s6.append(float(result[8]))
  37. s7.append(float(result[9]))
  38. s8.append(float(result[10]))

I have to use a loop somehow and read all the data.

Besides that, I need to plot the data vs. time. How do I do that in code? I tried with the datetime function, but that is not working.

Could you give me some ideas?

Thank you.

I described it in the previous section.
The Python version is 3.11.

英文:

I have taken some data from a instrument. These data contain the temperature with time. A part of the data looks like,

  1. 00:00:08
  2. section A 76.21
  3. section B 76.54
  4. section C 76.66
  5. section D 77.87
  6. 00:15:30
  7. section A 76.23
  8. section B 76.89
  9. section C 77.98
  10. section D 78.60

and the file continues till end.
00:00:08 means 00 hour, 00 minutes, and 08 seconds.

Now I need to read this data and make a plot of the temperature with the time.

My code till now is following,

  1. import matplotlib.cm as cm
  2. from matplotlib.dates import DateFormatter
  3. from datetime import datetime
  4. import re
  5. def main():
  6. file = r"C:\Users\sarad\ss1.dat.log"
  7. index =[]
  8. month =[]
  9. day =[]
  10. year =[]
  11. hour =[]
  12. minute =[]
  13. second =[]
  14. s1 =[]
  15. s2 =[]
  16. s3 =[]
  17. s4 =[]
  18. s5 =[]
  19. s6 =[]
  20. s7 =[]
  21. s8 =[]
  22. with open(file) as line:
  23. while True:
  24. result=re.findall(r"(\d+(?:.\d+){0,1})",line.read())
  25. if not result:
  26. break
  27. hour.append(int((result[0])))
  28. minute.append(int((result[1])))
  29. second.append(int((result[2])))
  30. s1.append(float(result[3]))
  31. s2.append(float(result[4]))
  32. s3.append(float(result[5]))
  33. s4.append(float(result[6]))
  34. s5.append(float(result[7]))
  35. s6.append(float(result[8]))
  36. s7.append(float(result[9]))
  37. s8.append(float(result[10])) ```
  38. I have to use loop somehow and read all the datas .
  39. Beside that i need to plot the datas vs time . How do i do that in code ? I tried with datetime function but that is not working.
  40. Could you give some idea ?
  41. Thank you.
  42. I described it in the previous section.
  43. The python is 3.11 version.
  44. </details>
  45. # 答案1
  46. **得分**: 1
  47. Here's the translated code part:
  48. ```python
  49. 你的文本结构看起来非常良好。解析它不应该是问题。以下是我会如何处理它的方式。
  50. import pandas as pd
  51. import matplotlib.pyplot as plt
  52. from collections import defaultdict
  53. text = '''
  54. 00:00:08
  55. section A 76.21
  56. section B 76.54
  57. section C 76.66
  58. section D 77.87
  59. 00:15:30
  60. section A 76.23
  61. section B 76.89
  62. section C 77.98
  63. section D 78.60
  64. '''
  65. table = defaultdict(dict)
  66. for line in text.split('\n'):
  67. if line:
  68. entry = line.strip()
  69. if ':' in entry:
  70. t = entry
  71. else:
  72. _, sample, data = entry.split()
  73. table[t].update({sample:float(data)})
  74. print(table)
  75. df = pd.DataFrame(table).T
  76. print(df)
  77. df.plot()
  78. # 设置刻度标签和旋转
  79. plt.xticks(
  80. ticks=range(len(df)),
  81. labels=df.index,
  82. rotation=45
  83. )
  84. plt.show()
  85. ```
  86. Please note that I've only translated the code part, as requested.
  87. <details>
  88. <summary>英文:</summary>
  89. Your text looks very well structured. It shouldn&#39;t be a problem to parse it. Here is how I would do it.
  90. import pandas as pd
  91. import matplotlib.pyplot as plt
  92. from collections import defaultdict
  93. text = &#39;&#39;&#39;
  94. 00:00:08
  95. section A 76.21
  96. section B 76.54
  97. section C 76.66
  98. section D 77.87
  99. 00:15:30
  100. section A 76.23
  101. section B 76.89
  102. section C 77.98
  103. section D 78.60
  104. &#39;&#39;&#39;
  105. table = defaultdict(dict)
  106. for line in text.split(&#39;\n&#39;):
  107. if line:
  108. entry = line.strip()
  109. if &#39;:&#39; in entry:
  110. t = entry
  111. else:
  112. _, sample, data = entry.split()
  113. table[t].update({sample:float(data)})
  114. print(table)
  115. #defaultdict(&lt;class &#39;dict&#39;&gt;, {&#39;00:00:08&#39;: {&#39;A&#39;: 76.21, &#39;B&#39;: 76.54, &#39;C&#39;: 76.66, &#39;D&#39;: 77.87}, &#39;00:15:30&#39;: {&#39;A&#39;: 76.23, &#39;B&#39;: 76.89, &#39;C&#39;: 77.98, &#39;D&#39;: 78.6}})
  116. df = pd.DataFrame(table).T
  117. print(df)
  118. # A B C D
  119. #00:00:08 76.21 76.54 76.66 77.87
  120. #00:15:30 76.23 76.89 77.98 78.60
  121. df.plot()
  122. #set ticks labels and rotation
  123. plt.xticks(
  124. ticks = range(len(df)),
  125. labels = df.index,
  126. rotation = 45
  127. )
  128. plt.show()
  129. [![enter image description here][1]][1]
  130. To read from file:
  131. file_name = &#39;myfile.txt&#39;
  132. with open(file_name, &#39;r&#39;) as f:
  133. for line in f:
  134. entry = line.strip():
  135. if entry:
  136. if line.strip():
  137. if &#39;:&#39; in entry:
  138. t = entry
  139. else:
  140. _, sample, data = entry.split()
  141. table[t].update({sample:float(data)})
  142. ...
  143. [1]: https://i.stack.imgur.com/PHQF9.png
  144. </details>

huangapple
  • 本文由 发表于 2023年7月6日 18:25:00
  • 转载请务必保留本文链接:https://go.coder-hub.com/76627850.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定