You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+2-4Lines changed: 2 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,14 +9,12 @@ We appreciate your contributions!
9
9
5. Create new Pull Request
10
10
11
11
## Contribution Ideas
12
-
-**Remove necessity for `pip install .`**: I think by uploading packages to PyPi we can reduce the installation code steps by consolidating `pip install -r requirements.txt`and `pip install .`. If that's possible that'd be great.
12
+
-**Develop an Automated End-to-End Testing System**: Build an automated testing framework that can be run before merging PRs to `main` to confirm no test cases broke. An example of such a test case would be "go to google docs and write a poem". This testing system should be flexible to add new test cases in the future and reduce the time spent on manually testing each PR.
13
13
-**Improve performance by finding optimal screenshot grid**: A primary element of the framework is that it overlays a percentage grid on the screenshot which GPT-4v uses to estimate click locations. If someone is able to find the optimal grid and some evaluation metrics to confirm it is an improvement on the current method then we will merge that PR.
14
14
-**Improve the `SUMMARY_PROMPT`**
15
-
-**Create an evaluation system**
16
15
-**Improve Linux and Windows compatibility**: There are still some issues with Linux and Windows compatibility. PRs to fix the issues are encouraged.
17
-
-**Enabling New Mouse Capabilities**: (drag, hover, etc.)
18
16
-**Adding New Multimodal Models**: Integration of new multimodal models is welcomed. If you have a specific model in mind that you believe would be a valuable addition, please feel free to integrate it and submit a PR.
19
-
-**Framework Architecture Improvements**: Think you can enhance the framework architecture described in the intro? We welcome suggestions and PRs.
17
+
-**Iterate `--accurate` flag functionality**: Look at https://github.com/OthersideAI/self-operating-computer/pull/57 for previous iteration
20
18
21
19
## Guidelines
22
20
This will primarily be a [Software 2.0](https://karpathy.medium.com/software-2-0-a64152b37c35) project. For this reason:
Copy file name to clipboardExpand all lines: README.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -63,23 +63,23 @@ python3 -m venv venv
63
63
```
64
64
source venv/bin/activate
65
65
```
66
-
6.**Install Project Requirements and Command-Line Interface**:
66
+
5.**Install Project Requirements and Command-Line Interface: Instead of using `pip install .`, you can now install the project directly from PyPI with:**
67
67
```
68
-
pip install .
68
+
pip install self-operating-computer
69
69
```
70
-
7.**Then rename the `.example.env` file to `.env` so that you can save your OpenAI key in it.**
70
+
6.**Then rename the `.example.env` file to `.env` so that you can save your OpenAI key in it.**
71
71
```
72
72
mv .example.env .env
73
73
```
74
-
8.**Add your Open AI key to your new `.env` file. If you don't have one, you can obtain an OpenAI key [here](https://platform.openai.com/account/api-keys)**:
74
+
7.**Add your Open AI key to your new `.env` file. If you don't have one, you can obtain an OpenAI key [here](https://platform.openai.com/account/api-keys)**:
75
75
```
76
76
OPENAI_API_KEY='your-key-here'
77
77
```
78
-
9.**Run it**!
78
+
8.**Run it**!
79
79
```
80
80
operate
81
81
```
82
-
10.**Final Step**: As a last step, the Terminal app will ask for permission for "Screen Recording" and "Accessibility" in the "Security & Privacy" page of Mac's "System Preferences".
82
+
9.**Final Step**: As a last step, the Terminal app will ask for permission for "Screen Recording" and "Accessibility" in the "Security & Privacy" page of Mac's "System Preferences".
@@ -132,4 +132,4 @@ Stay updated with the latest developments:
132
132
133
133
## OpenAI Rate Limiting Note
134
134
The ```gpt-4-vision-preview``` model is required. To unlock access to this model, your account needs to spend at least \$5 in API credits. Pre-paying for these credits will unlock access if you haven't already spent the minimum \$5.
135
-
Learn more **[here](https://platform.openai.com/docs/guides/rate-limits?context=tier-one)**
135
+
Learn more **[here](https://platform.openai.com/docs/guides/rate-limits?context=tier-one)**
0 commit comments